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Abstract 

The experimental status of measurements of the top quark mass is reviewed. After 
an introduction to the definition of the top quark mass and the production and decay of 
top quarks, an in-depth comparison of the analysis techniques used in top quark mass 
measurements is presented, and the systematic uncertainties on the top quark mass are 
discussed in detail. This allows the reader to understand the experimental issues in the 
measurements, their limitations, and potential future improvements, and to comprehend 
the inputs to and formation of the current world average value of the top quark mass. 
Its interpretation within the frameworks of the Standard Model and of models beyond 
it are presented. Finally, future prospects for measurements of the top quark mass and 
their impact on our understanding of particle physics are outlined. 
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1 Introduction 

The top quark is the heaviest known elementary particle. While it has not yet been 
possible to answer the question why its mass is so large, the precise measurements 
of the top quark mass that have become available since its discovery have already 
greatly improved constraints on our picture of nature; for example they have made 
predictions of the mass of the as yet undiscovered Higgs boson possible. This report 
first defines the top quark mass and then describes in detail the techniques used to 
measure it. This is followed by a description of the systematic uncertainties. The 
current world average value of the top quark mass is presented, and the constraints it 
provides on elementary particle physics models are shown. Finally, potential future 
improvements of the precision on the top quark mass are outlined. 

Of all known elementary fermions, the top quark has by far the largest mass. This renders 
the top quark unique from a theoretical standpoint: The top quark Yukawa coupling is close 
to unity, which may be a hint that the top quark mass is related with electroweak symmetry 
breaking. Via loop contributions, the masses of the W boson, the top quark, and the yet 
undiscovered Higgs boson are interrelated so that the Higgs mass (which is not predicted in the 
Standard Model of elementary particle physics) may be constrained from precise measurements 
of the W boson and top quark masses [H |2J . Experimentally, on the other hand, the top quark 
is unique as it is the only quark that does not hadronize because its lifetime is too short [3]; 
it is therefore possible to directly measure the properties of the quark instead of a hadron 
containing the quark of interest. 

Long before the discovery of the top quark, its existence as the up-type partner of the 
bottom quark had been postulated within the Standard Model, and its mass could be predicted 
from precision measurements of electroweak observables. Currently, indirect constraints within 
the Standard Model yield a top quark mass value of m t = 178 + ig GeV [2^. The top quark 
was finally discovered jl] by the CDF and DO experiments in proton-antiproton collisions at 
the Fermilab Tevatron Collider. Since then, measurements of the top quark mass have been 
performed both during Run I of the Tevatron in the 1990s at a proton-antiproton center-of- 
mass energy of y/s = 1.8 TeV [5] and during the ongoing Run II at an increased center-of- 
mass energy of 1.96 TeV and with larger data sets [HI El IE]- Their average value of m t = 
171.4 ± 2.1 GeV [9] is in striking agreement with the indirect prediction, thus supporting the 
Standard Model as the theory of nature. Innovative measurement techniques have made this 
precision possible, which already surpasses the original expectations for Tevatron Run II [TO] . 
Within the Standard Model, a value of the Higgs boson mass close to the current lower 
exclusion limit is favored [2J. 

The Tevatron experiments have performed many more measurements of top quarks. The 
total cross section for top-antitop pair production [TT1 [T2] is consistent with the predictions 
from QCD [15] . using the above top quark mass as input. No evidence for effects beyond those 
predicted in the Standard Model has been found in production and decay of top quarks [TU 
[151 HE]- A recent review of top quark measurements can be found in [IT] . 

throughout this report, the convention H = 1, c = 1 is followed. Charge conjugate processes are included 
implicitly. Top quark masses quoted are pole masses unless noted otherwise - see Section 12.11 for a definition 
of the pole mass. 
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1 INTRODUCTION 



To date, the Tevatron Collider still provides the only possibility to produce top quarks. 
In the near future, the LHC proton-proton collider will start operation, which is expected to 
provide much larger samples of top quark events. While the measurement of the top quark 
mass will be subject to very similar systematic uncertainties, it can be assumed that the 
large data samples will allow for a further reduction of the error. However, only a linear e + e~ 
collider scanning the ti production threshold will allow for an order of magnitude improvement 
of the precision. 

This paper provides an overview of current measurements of the top quark mass at hadron 
colliders, focusing on the Tevatron Run II results. The purpose of this document is twofold: 

• to review the current status of top quark mass measurements, compare the assumptions 
made in the various analyses, discuss the limiting systematic uncertainties together 
with potential future improvements, and to give an overview of the interpretation of the 
measurements; and 

• to provide a detailed description of the measurement techniques developed and used so 
far for the measurement of the top quark mass, not only to complement the information 
on the physics results, but also as a reference for the development of future measurements 
(of the top quark or other particles). 

The general structure of the paper is as follows: Section [2] gives a brief summary of 
definitions of the top quark mass and discusses the relevance of measurements of the top quark 
mass for elementary particle physics. Section [3] then outlines the production mechanisms for 
top quarks at hadron colliders and the event characteristics. The steps needed to obtain a 
set of data events with which to measure the top quark mass are described in Sections H] 
(reconstruction of top quark events) and [5] (detector calibration). 

An overview of the different techniques (template, Matrix Element, and Ideogram meth- 
ods) to determine the top quark mass from such a set of calibrated data events is given in 
Section [6j The principle of template based measurements and examples using different event 
topologies are discussed in Section [71 Section [8] gives an in-depth description of the Matrix 
Element method, and the Ideogram method is described in Section [9j The fitting procedure 
to determine the top quark mass is discussed in Section [JDJ 

The current world average of the top quark mass is already dominated by systematic 
uncertainties. The different sources of systematic uncertainties and the estimation of the size 
of the corresponding effects are discussed in Section [TTJ Section [12] then summarizes the 
current knowledge of the top quark mass and the interpretation of these results and outlines 
possible future developments. Section [T51 summarizes and concludes the paper. 
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2 Definition and Relevance of the Top Quark Mass 

The definition of the mass of a particle may seem trivial. However, when used in 
conjunction with a quark it is in fact by no means obvious how "mass" should best 
be defined. This section introduces different possible definitions and states in general 
terms which kind of measurement determines which mass. The section then outlines 
how the precise knowledge of the top quark mass improves our understanding of 
elementary particles and the description of their interactions within the Standard 
Model of particle physics. 



2.1 Definitions of the Top Quark Mass and Measurement Concepts 

In general, "the" mass m of a particle is only defined within a theory or model in which it 
occurs as a parameter. The mass of a particle can then be determined through a comparison 
of measurements with the predictions of the theory (the validity of the mass value obtained 
is then restricted to this particular theory). While it is straightforward to find a suitable 
definition of the mass of a color-neutral particle, there are several possibilities for defining 
the mass of a (color-charged) quark. This section illustrates the underlying concepts and 
defines how the word mass is used in conjunction with the top quark in the remainder of this 
report. See Reference [3] for more detailed reviews of Quantum Chromodynamics (QCD), 
quark masses, and top quark physics. 

For each quark, a mass parameter is introduced in the QCD Lagrangian. (In the Standard 
Model, the value of this parameter is proportional to the Yukawa coupling of the quark to 
the Higgs boson.) The value depends on the renormalization scheme and the renormalization 
scale /i. At high energies, the QCD coupling constant a s is small, and observables are typically 
calculated in perturbation theory, commonly applying the MS renormalization scheme. (The 
MS scheme is used by the Particle Data Group to report all quark masses except the top 
quark mass.) 

For an observable (i.e., non-colored) particle, the position of the pole in the propagator 
defines the mass. In perturbative QCD, this pole mass can also be used as a definition of 
quark masses. However, the pole mass cannot be used to arbitrarily high accuracy: Because 
of confinement (i.e., because of non-perturbative effects in QCD), the full quark propagator 
does not have a pole. This is true even for the top quark which does not hadronize before 
decaying. The general argument is presented in a very intuitive way in Reference [IB] . The 
relation between the pole mass and MS mass is known to three loops, see [3] and references 
therein, but there necessarily remains an uncertainty of order Aqcd i n the pole mass [IS] . 

Different definitions of the pole mass are used. An unstable particle can generally be 
described by a Breit-Wigner resonance [19] 



/00~- — — (i) 

7T (s — m 2 ) 2 + (mr) 2 

where s = p 2 is the squared four-momentum of one particle, and the properties of the res- 
onance are described by a constant width T and the corresponding (pole) mass rh. It is 
possible to absorb higher-order corrections into the pole mass definition. For example, for the 
experimental determination of the Z boson mass an s-dependent width is used to describe the 
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resonance, with the term (mf) replaced by (sT/m). To accomodate the same experimental 
data, different numerical values of the mass parameter are needed in the two approaches; for 
the Z boson the relation between the two parameter values is given by [2] 




m z = rhz\ l + «% + 34.20 MeV . (2) 
V m z 

Similarly, different definitions are possible for the top quark mass. Measurements of the 
top quark mass at a hadron collider rely on comparisons of the data with simulated events, 
and thus it is important to state the definition adopted in the simulation which is used in the 
measurement. The two simulation programs used most commonly in current measurements 
are alpgen [20J, which uses fixed widths in propagators, and pythia [19J, where a factor 
(1 — 2.5a s (s)/7r) is included for top quarks to approximate loop corrections. The energy 
dependence of a s in principle introduces a difference between the two definitions; this is 
however negligible compared to the intrinsic uncertainty of order Aqcd- 

To determine the top quark mass defined in any given scheme, one has to find observables 
measurements of which can be compared to theory predictions which in turn depend on this 
top quark mass. In practice, there are three fundamentally different approaches: 

• Indirect constraints from electroweak measurements: Even before the first direct 
observation of top quarks, indirect constraints were obtained from fits of the Standard 
Model prediction as a function of the top quark mass to precision measurements of 
electroweak observables [U |2] . This method of course has the drawback that it is not 
an actual discovery of the top quark, and that the mass value is only valid within the 
Standard Model (or in other theories whose predictions do not significantly differ from 
those of the Standard Model). 

• Reconstruction of top quark decay products: Today and in the near future, top 
quarks are and will be produced at the hadron colliders Tevatron and LHC, allowing 
for a direct measurement of the top quark mass from the reconstructed decay products. 
The momenta of the decay products are related according to 

mS? = Pt{i) 2 = ^1>WJ » (3) 

where p denotes the 4-momentum of a particle and the sum is over all decay products 
j of the top quark t in a specific event i. A measurement based on the momenta of the 
decay products thus ideally corresponds to a measurement of the pole mass since the 
squared sum of four-momenta as given in Equation ([3]) enters in the denominator 

p\ - m\ + im t T t (4) 

of the propagator term. Individual measurements differ in how an observable that is 
related with the top quark mass is constructed from the measured decay products, and 
the situation is more complicated for measurements relying on complex techniques like 
the Matrix Element or Ideogram methods discussed in Sections [S] and M In the most 
precise measurements in the £+jets channel, the experimental information comes to a 
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very large extent from the invariant mass of the reconstructed top quark decay products; 
thus the measured value can be expected to correspond (most closely) to the pole mass, 
but this issue has not yet been studied in detail. 

In contrast to the other quarks (up, down, charm, strange, and bottom), the top quark 
decays before forming hadrons [3]. This makes a direct measurement of the top quark 
mass (instead of a hadron mass) possible; hadronization only affects the decay products 
of the top quark and leads to jet formation, cf. Section 13.21 

Top quark mass measurements based on the decay products are valid not only within 
the Standard Model but in any model which does not introduce significant changes to 
those features of top quark production and decay that are used in the measurement. 
However, the results are subject to an intrinsic uncertainty of order Aqcd as mentioned 
above. 

• ti threshold scan: In the long-term future, it will be desirable to determine the 
top quark mass based on a definition that is not subject to the uncertainty on the pole 
mass, even though the current combined experimental uncertainty is almost a magnitude 
larger. The best-known example is the measurement of the cross section for top-antitop 
pair production near threshold at a future e + e~ collider. This experimentally very clean 
measurement could be related to theory predictions that are calculated as a function of 
a top quark mass parameter that can be translated into the MS mass with much smaller 
uncertainty [2Tj . The principle of the measurement is analogous to the determination 
of the W boson mass from the measurement of the WW production cross section at 
threshold at LEP2. 

This report focuses on the techniques, current results, and prospects of top quark mass 
measurements at the Tevatron, where the mass is reconstructed from the properties of the 
decay products. Consequently, the pole mass definition is implicitly assumed throughout the 
remainder of this report unless noted otherwise. This is consistent with the conventions of 
the Tevatron Electroweak Working Group [9] and the Particle Data Group [3j- 

2.2 Relevance of the Top Quark Mass within the Standard Model 

In perturbation theory, predictions for observables receive contributions from loop diagrams, 
where particles contribute even if they are too massive to be produced on shell. The size 
of these corrections to leading-order predictions depends on the values of the masses of the 
particles in the loops. Of particular importance for Standard Model fits is the dependence 
of the W boson mass on the top quark and Higgs boson masses. The lowest-order diagram 
leading to the dependence on the top quark mass is shown in Figure QJa), those resulting in 
the Higgs mass dependence in Figures [H(b) and (c). The corrections that arise from these 
diagrams are quadratic in the top quark mass, but only logarithmic in the Higgs boson mass 
(yielding a much weaker dependence). 

Since the dependence on the Higgs boson mass is weak, measurements of the W mass 
(and of other electroweak observables) lead to indirect constraints on the top quark mass. 
This led to predictions of the mass of the top quark before its actual discovery, as already 
outlined in Section |2"TT1 Also, precise measurements of both the W boson and top quark masses 
result in constraints on the Standard Model Higgs boson mass. In the following sections, the 
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Figure 1: Feynman diagrams of loop processes that lead to a dependence of the W boson 
propagator on (a) the top quark mass and (b, c) the Higgs boson mass. 



experimental measurements of the top quark mass are discussed in detail. The interpretation 
of the current results within the Standard Model (and models beyond the Standard Model) 
is then further discussed in Section 112.21 
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3 Top Quark Production and Decay at Hadron Colliders 

Top quarks can be studied best when produced on shell in a collider experiment. This 
is currently only possible at the Fermilab Tevatron proton- antiproton collider near 
Chicago. In the near future, the LHC proton-proton collider at CERN near Geneva 
will produce large numbers of top quarks. This section describes the properties of 
events produced in reactions involving top quark decays. 

In this section, the mechanisms for top quark production in hadron collisions {pp or pp) 
are described. Events containing a ti pair are used to measure the top quark mass, and thus 
the different topologies of these events, which depend on the top quark decays, are discussed. 
The relevant background processes are also described. 

3.1 Top Quark Production 

Because of the large top quark mass, high energies are required to produce top quarks, and 
the production processes (including those proceeding via the strong interaction) can be de- 
scribed in perturbation theory. The internal structure of the colliding hadrons is resolved, 
and top quarks are thus produced in a hard-scattering process of two constituent partons 
(quarks/ ant iquarks or gluons) inside the hadrons. The description of the reaction factorizes 
into the modeling of the constituents of the incoming hadrons, of the hard-scattering process 
yielding the top quarks (and also describing their subsequent decay), and of the formation 
of the observable final-state particles. A schematic illustration of this factorization scheme is 
given in Figured 

To calculate the (differential) cross section for top quark production, a factorization scale 
\j? F is introduced to separate the hard-scattering partonic cross section from the modeling of 
the constituents of the proton/antiproton. The latter is independent of the hard-scattering 
process, and parton distribution functions (PDFs) /p DF ( introduced that describe 

the probability density to find a parton a (quark or antiquark of given flavor or gluon) with 
longitudinal momentum fraction x inside a colliding proton. The PDFs cannot be calculated, 
and are determined in fits to experimental data. As an example, the CTEQ5L parametriza- 
tion [22] is shown in Figure [3] for a scale of p F — (175 GeV) 2 (a common choice used in current 
measurements for the description of top quark production). Even though experimental ob- 
servables cannot depend on the factorization scale, the PDFs (and the hard-scattering cross 
section) depend on the value of \j? F chosen, and an overall dependence remains if calculations 
are not done to infinite order in perturbation theory. In the following sections, the depen- 
dence on the factorization scale is not mentioned explicitly, and the symbol /pdf( x ) is used. 
To assess the systematic uncertainty related to the choice of factorization scale, experiments 
compare the results of simulations based on different values for the scale. 

There are two main mechanisms for top quark production at hadron colliders: top-antitop 
pair production via the strong interaction, and single top production via the electroweak 
interaction. Single top production has only recently been observed [23J, and this process is 
not (yet) used to measure the top quark mass. Consequently, the emphasis of this section is 
on ti pair production. 

The leading-order Feynman diagrams for the hard-scattering process of ti production are 
shown in Figure HI They apply to both proton-antiproton (Tevatron) and proton-proton 
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Figure 2: Schematic drawing illustrating the concept of factorization. Shown is a collision of 
two hadrons leading to a hard- scattering process at a scale Q 2 . This hard interaction is initiated 
by two partons of momenta X\P\ and X2P2, where Pi and P2 are the momenta of the colliding 
hadrons. The partonic cross section a of the hard interaction can be calculated perturbatively, 
based on the renormalization and factorization scales n 2 R and fi 2 F . The factorization scale is 
also used to evaluate the parton distribution functions /pdfj which parametrize the probabilities 
to find the partons a\ and 02 inside the colliding hadrons. If the hard interaction involves the 
production of top quarks, their decays are included in its description, since the top quark 
lifetime is so short that no top hadrons are formed. The observable final-state particles are 
then formed in a hadronization process which again cannot be calculated perturbatively, but is 
independent of the hard interaction. 




Figure 3: The CTEQ5L parametrization JEEl of the distribution functions for different parton 
species in the proton as a function of the momentum fraction x of the proton carried by the 
parton, for a factorization scale [j? F = (175 GeV) 2 . 



3.1 Top Quark Production 
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Figure 4: Leading- order Feynman diagrams of the hard- scattering processes that lead to tt 
production at a hadron collider: (a) qq — >■ tt, (b) gg —> tt. 



(LHC) collisions. When contributions from higher-order diagrams are included, renormal- 
ization of divergent quantities becomes necessary. This leads to the introduction of another 
scale, the renormalization scale fi R . In practice, the factorization and renormalization scales 
are often chosen to be equal. 

To obtain the tt production cross section in hadron collisions, the partonic cross section 
a must be folded with the appropriate parton distribution functions fp DF (x), integrated over 
all possible initial-state parton momenta, and then summed over all contributing initial-state 
parton species: 



a(P 1: P 2 ) / dx 1 dx 2 fp 1 DFl (aci, /4) fp D F 2 ( x 2, /4) & ( x i p i, X 2 P 2, a s (/4) 

ai,a 2 ^ 



(5) 



where Pi and P 2 are the momenta of the incoming hadrons, the sum is over all possible 
combinations of parton species a\ and a 2 that can initiate the hard interaction, and the hard- 
scattering cross section a depends on their momenta, the factorization scale, and the ratio 
of the scale Q 2 of the hard interaction and the renormalization scale. Resulting Standard 
Model predictions for the tt production cross section at the Tevatron and LHC are listed 
in Table [TJ At the Tevatron, in proton-antiproton collisions at ^Js = 1.96 TeV, the quark- 
antiquark induced process dominates. At the LHC, in proton-proton collisions at a/s = 
14 TeV, the fraction x of the proton momentum carried by the colliding partons may be much 
smaller. Because the gluon PDF is much larger at small x than the quark PDFs, the gluon 
induced process dominates at the LHC. The overall tt cross section at the LHC is two orders 
of magnitude larger than that at the Tevatron. 

Production of single top quarks via the electroweak interaction is expected to proceed 
via three different channels. Predictions for the Standard Model cross sections are given in 
Table [TJ and Figure shows the leading-order diagrams for the three processes. The remainder 
of this report focuses on tt pair production. 
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Channel 


Tevatron Run II: 
pp collisions, 
= 1.96 TeV 


LHC: 
pp collisions, 
Vs = 14 TeV 


ti pair production 


5.8 - 7.4 pb p3] 


830 tlo pb [25 


single top, s-channel 
single antitop, s-channel 


0.98 ± 0.04 pb [251 


7.2 + pb [26] 
4.0 t g-j pb [26] 


single top, t-channel 
single antitop, t-channel 


2.2 ±0.1 pb [25] 


146 ± 5 pb [26] 
89 ± 4 pb [22] 


single top+antitop, W+t production 


0.26 ± 0.06 pb [25] 


82 ± 8 pb [26] 



Table 1: Predicted top quark production cross sections for various processes at the Tevatron 
and LHC. The predictions are at next-to-leading order, including threshold corrections from 
soft gluons. All values are quoted for an assumed top quark mass of 175 GeV. For the depen- 
dence of the cross sections on the top quark mass hypothesis see Figure (ti production) and 
References Wh\ \2b ^ (single top/antitop production). The range of ti cross sections quoted for 
the Tevatron includes PDF uncertainties ( which have been found to be dominant by studying 
the variations of the CTEQ6 JIT7| / and MRST W§ parametrizations) while the LHC uncertainty 
is only based on a variation of the renormalization scale. At the Tevatron, the relative con- 
tributions of qq and gg induced process are roughly 85% and 15%; at the LHC these numbers 
are about 10% and 90%, respectively. Wherever the cross sections for single top and antitop 
production are equal, the sum of the cross sections for both processes is listed. This is the case 
for single top/antitop production at the Tevatron because it is a proton- antiproton collider. 
The cross sections for production of a W boson in association with a top or antitop quark 
are equal also at the LHC because the b and b PDFs are equal. The single top cross sections 
quoted are similar to the next-to-leading order values published in I2$tf . 
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Figure 5: Leading- order Feynman diagrams of the hard-scattering processes that lead to single 
top production at a hadron collider: (a) s-channel, (b) t-channel, (c) W+t associated produc- 
tion. 



3.2 Top Quark Decay and Event Topologies 

In the Standard Model, top quarks decay almost exclusively to a b quark and a W boson [3j [30] , 
and the top quark decay width being much larger than Aqcd, no top quark hadronization takes 
place. Therefore, the event topology of a ti event is determined by the decays of the two W 
bosons. The b quarks and quarks from hadronic W decays hadronize and are reconstructed 
as jets in the detector. The presence of final-state neutrinos is signalled by missing transverse 
energy ]fir, defined as the magnitude of the transverse momentum vector fx needed to balance 
the event in the plane perpendicular to the beam direction. 

Commonly, the event topologies are classified as dilepton, lepton+jets (£+jets), and all-jets 
topologies. These three categories exclude events with one or more tauonic W decays, which 
are more difficult to reconstruct and provide less mass information than corresponding events 
with electronic or muonic W decays because of the additional neutrinos from r decays. In this 
report, the word "lepton" always refers to an electron or muon unless otherwise mentioned. 

In the following, the characteristics of the topologies used for top quark mass measurements 
are discussed, the main backgrounds are listed, and the consequences for measurements of the 
top quark mass are mentioned. The relative abundance of events in the various topologies is 
shown schematically in Figure [6j 

• Dilepton Events: In about 5% of ti events, both W bosons decay into an electron or a 
muon plus the corresponding neutrino. These so-called dilepton events are characterized 
by two oppositely charged isolated energetic leptons, two energetic b jets, and missing 
transverse energy due to the two neutrinos from the W decay. 
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e* e 14% 

dilepton" 




all-jets" 46% 



14% 



x+jets 14% 



it 



lepton+jets 



ii 



Figure 6: Relative abundance of the ti event topologies, calculated from the W branching 
fractions listed in Reference J2J/. The figure has been taken from I31f . and the values have been 
updated. Note the rounding errors; the total "dilepton" and u lepton+jets" branching fractions 
are about 5% and 29%, respectively. 



Because of the two charged leptons, these events are relatively easy to select. The 
largest physics background is from production of a Z boson (decaying to e + e~ or /i + /i~) 
in association with two jets. This background affects only the dielectron and dimuon 
channels and can be reduced by requiring that the invariant dilepton mass be inconsistent 
with the Z mass. Correspondingly, the e/x channel is very clean; here, the main physics 
background is from Z — > t + t~ decays where the Z boson is produced in association with 
two jets. Instrumental background where a hadronic jet with a leading 7r° — > 77 decay 
is misidentified as an isolated electron is also important at the Tevatron experiments. 

In spite of the small backgrounds the statistical information on the top quark mass 
that can be extracted per dilepton event is limited because the event kinematics is 
underconstrained when the top quark mass is treated as an unknown. The 4-momenta 
of the 6 final-state particles are fully specified by 24 quantities; the 6 masses are known, 
and the 3-momenta of four particles (the two jets and the two charged leptons) are 
measured in the detector. Additional constraints can be obtained by assuming transverse 
momentum balance of the event (2), the known masses of the W bosons (2), and by 
imposing equal top and antitop quark masses (1 constraint). This leads to 23 quantites 
that are known, measured, or can be assumed. The event kinematics could therefore only 
be solved if the value of the top quark mass itself were also assumed. Consequently, to 
measure the top quark mass, additional information is used, e.g. the relative probabilities 
for different configurations of final-state particle momenta. 

• Lepton+Jets Events: Those 29% ti events with one W — > ev or W — Y \w and one 
hadronic W boson decay are called lepton+jets events. They contain one energetic 
isolated lepton, four energetic jets (two of which are b jets), and missing transverse 
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energy. 

The main background is from events where a leptonically decaying W is produced in as- 
sociation with four jets. Multijet background where one jet mimicks an isolated electron 
also plays a role. 

In lepton+jets events, the transverse momentum components of the one neutrino can 
be obtained from the missing transverse momentum, and the event kinematics is over- 
constrained when assuming equal masses of the top and antitop quarks and invariant Iv 
and qq' masses equal to the W boson mass. The measurement of the top quark mass is 
however complicated by the fact that the association of measured jets with final-state 
quarks is not known. The number of possible combinations and also the background 
can be reduced when b jets are identified (6-tagging). 

Today, the lepton+jets topology yields the most precise top quark mass measurements. 

• All-Jets Events: In 46% of ti events both W bosons decay hadronically, yielding 6 
energetic jets, no charged leptons, and no significant missing transverse energy. 

The background from multijet production is large (and cannot easily be modeled with 
Monte Carlo generators). It can be reduced with 6-tagging information, which is also 
important to reduce combinatorics in the jet-quark assignment. 

The aim is to measure the top quark mass in all three categories in order to cross-check 
the measurements and to search for signs of effects beyond the Standard Model. The above 
picture could be changed if non-Standard Model particles with masses below the top quark 
mass exist. An example are top quark decays to a b quark and a charged Higgs boson in 
supersymmetric models: Depending on the parameters of the model, charged Higgs decays 
could alter the relative numbers of events in the different ti event topologies or lead to events 
with extra jets in the final state [30] . 
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4 Event Reconstruction and Simulation 

The previous section gave an overview of the production of top quarks at hadron col- 
liders and of the topologies of top quark events. This section describes how top quark 
events are reconstructed in the detector. It also briefly introduces the simulation of 
events. 

To measure the top quark mass, tt events must first be identified online as potentially 
interesting and saved for further analysis. The tt decay products (charged lepton(s), jets, and 
missing transverse energy from the neutrino(s)) are then reconstructed. The top quark mass 
is obtained from the energies/momenta and directions of the decay products measured in the 
detector. 

A brief overview of the CDF and DO detectors at the Tevatron is given in Section 14.11 In 
Section I4.2[ the trigger requirements used at CDF and DO for the different tt event topologies 
are presented, and Section 14.31 briefly discusses the reconstruction and selection of electrons, 
muons, and jets and the identification of h quark jets. Section 14.61 describes the simulation 
of events used to verify and calibrate the techniques for the top quark mass measurements. 
The detector calibration and the determination of the detector resolution are described in 
Section [5j 

4.1 The CDF and DO Detectors 

The CDF and DO Run II detectors are described in detail elsewhere [321 [33]. Both detectors 
have the standard cylindrical setup of a general-purpose collider detector. From the interaction 
region in the center of the detector, particles first traverse the tracking detector surrounding 
the beam pipe. Here, the trajectories of charged particles and their transverse momenta are 
measured. The tracking detector can be subdivided into a silicon microvertex detector needed 
for precise primary and secondary vertex reconstruction and a larger- volume tracking chamber 
providing the lever arm to reconstruct the transverse momentum from the curvature of the 
track in a solenoidal magnetic field. The calorimeters are used to measure the energy and 
direction of electrons, photons, and hadronic jets. They are adapted to the different properties 
of both electromagnetic and hadronic showers. Finally, the calorimeters are surrounded by 
tracking detectors which serve to identify muons, which are the only charged particles that 
traverse the calorimeter without being absorbed. Schematic drawings of both CDF and DO 
are shown in Figure [7J Both experiments employ a three-layer trigger system that allows for 
an online selection of events for further analysis. All subdetectors, their readout electronics, 
and the trigger system are adapted to the Tevatron bunch crossing frequency of 1/(396 ns). 

As far as details of some of the subdetectors are concerned, CDF and DO differ significantly. 
However, the general functionality is very similar, and both experiments reconstruct charged 
leptons, hadronic jets, secondary decay vertices, and missing transverse energy which are then 
used to select tt candidate events and measure the top quark mass. The experiments use 
a coordinate system centered at the interaction point with the z axis along the beam pipe. 
Directions are expressed in terms of the azimuthal angle around the beam pipe and the 
pseudorapidity rj — — In (tan(0/2)), where 9 is the polar angle relative to the z axis. 

Of the integrated luminosity of more than 2 fb _1 delivered to each of CDF and DO, up 
to 1 fb" 1 has been used so far in top quark mass measurements. In comparison, Run I mea- 
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Figure 7: Schematic drawings of the CDF [S^j (top) and DO I35\f (bottom) detectors during 
Tevatron Run II. 
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surements were based on integrated luminosities of the order of 100 pb . A total integrated 
luminosity per experiment of 4 — 9 fb^ 1 is expected until the end of Run II of the Tevatron. 

4.2 Trigger Strategies 

Triggering ti event candidates that involve at least one leptonic W decay is relatively straight- 
forward because of the presence of an isolated electron or muon with large transverse energy 
or momentum. The presence of energetic jets can be used as an additional trigger criterion. 

To identify dilepton candidate events, both CDF and DO require the events to be triggered 
by the presence of a high- electron or high-p^ muon [36, 37J. While CDF requires one 
electron or muon, in the DO analysis two charged leptons in the first-level trigger and one or 
two (depending on the channel) charged leptons in the high-level triggers are required. 

In the £+jets event topology, CDF also relies exclusively on the charged lepton trigger [38] . 
The DO experiment requires a charged lepton and a jet, both with large transverse momentum 
or energy, to be found in the trigger [39] . 

Triggering ti events in the all-jets channel is more difficult because of the large QCD 
multijet background. The CDF analysis [10] uses a trigger that requires at least four jets and 
a minimum scalar sum of transverse energies, Ht, of at least 125 GeV. In the all-jets channel, 
the DO experiment has performed a measurement of the ti cross section [41], but not yet of 
the top quark mass. 

The characteristics of dilepton and £+jets ti events are distinctive, so typical trigger ef- 
ficiencies are around 90% or above (see for example [42J). In the all-jets channel, the CDF 
experiment quotes a trigger efficiency of 85% [43J. In general, the trigger requirements and 
therefore also the efficiencies vary as conditions are adjusted to changing instantaneous lu- 
minosity. The efficiencies are measured in the data as outlined in Section [5] as a function 
of the momenta of reconstructed particles (charged leptons, jets) in the event. The overall 
probability for a simulated event to pass the trigger conditions is obtained as the weighted 
average of the trigger efficiencies, taking into account the relative integrated luminosity for 
which each trigger condition was in use [44] . The trigger efficiency depends on the top quark 
mass, mainly because of the pt or Et cuts imposed in the trigger, and this effect must be 
taken into account in the mass measurement. 

4.3 Reconstruction and Selection of Top Quark Decay Products 

The offline reconstruction of the events selected by the trigger criteria aims at (1) further 
reducing the backgrounds and (2) reconstructing the momenta of the ti decay products as 
precisely as possible to obtain the maximum information on the top quark mass. In this 
section, the reconstruction and selection of isolated energetic charged leptons, of energetic 
jets, and of the missing transverse energy in ti event candidates are discussed. Also, the 
different possibilities for the identification of bottom-quark jets are described. 

4.3.1 Charged Lepton Selection 

Electrons are identified by a charged particle track pointing at an electromagnetic shower in 
the calorimeter. Additional criteria are then applied [321115]: Background from mis-identified 
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hadrons is reduced based on the ratio of the energy measured in the electromagnetic and 
hadronic calorimeter, the shower shape, and on the quality of the match between the calorime- 
ter shower and the charged particle track. CDF in addition vetos electrons from photon con- 
version processes. Non-isolated electrons, e.g. from semielectronic heavy-hadron decays in 
jets, are rejected by isolation criteria that impose a maximum calorimeter energy in a cone 
around the electron. 

Muons traverse the calorimeter and leave a track both in the central tracking chamber and 
in the muon chambers. The following criteria are applied to select muons from W decay in 
ti events [39| l4"5] : Background from mis-identified hadrons is reduced based on the distance 
between the central track extrapolated to the muon chambers and the muon chamber track. 
In addition, CDF requires the energy deposit in the calorimeter to be consistent with that of 
a minimum ionizing particle, and rejects muons with too large a distance of closest approach 
in the transverse plane, do, to the beam spot. Cosmic ray muons are rejected based on timing 
information. As for electrons, non-isolated muons, e.g. from semimuonic heavy-hadron decays 
in jets, are rejected by isolation criteria requiring a maximum calorimeter energy in a cone 
around the muon not to be exceeded. The DO experiment in addition imposes a similar 
isolation criterion based on the transverse momenta of tracks in a cone around the muon 
direction. 

Finally, a fiducial and kinematic selection is applied. To ensure reliable electron recon- 
struction in the calorimeter, electron candidates must be well within the central or forward 
calorimeters, excluding the overlap regions around \rj\ ~ 1. Some analyses exclude electrons 
in the forward calorimeter. The pseudorapidity range within which muons can be identified is 
limited by the acceptance of the tracking chamber. Typically, electrons (muons) are required 
to have a transverse energy (momentum) larger than a cut value between 15 and 25 GeV, 
depending on the analysis. Here, the calibrated energy and momentum values are used; the 
detector calibration is described in Section [5J 

4.3.2 Primary Vertex Reconstruction 

The position of the primary vertex is needed in order to compute the jet directions and to 
identify bottom quark jets using secondary vertex information. While the position of the hard 
interaction in the transverse plane ("beam spot") is well determined, the interaction region 
extends over tens of centimeters along the beam line. Tracking information is used to measure 
the z position of the primary vertex for each event. Since there may be multiple interactions 
per event, the vertex associated with the ti decay has to be identified. This is done based on 
reconstructed charged lepton information (CDF analyses involving charged leptons), or the 
vertex most consistent with the ti decay is selected among the candidates |I6J Hi] . 

4.3.3 Jet Reconstruction and Selection 

The final-state quarks in ti events are reconstructed as jets, using a cone algorithm (13 HH] 
with radius ATI = a/(At/) 2 + (A</>) 2 = 0.4 (CDF) or 0.5 (DO). The jet transverse energy is 
defined using the primary vertex position described in the previous section. The DO experiment 
applies cuts to select well-measured jets j3H], and both CDF and DO ensure that calorimeter 
energy deposited by electron candidates is not used in the jet reconstruction. A minimum 
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number of jets within a fiducial calorimeter volume of typically \rj\ < 2.0 (CDF, [33]) or 
\rj\ < 2.5 (DO, [SH]) and with a (calibrated) transverse energy above a cut value of typically 15 
or 20 GeV is required. The calibration of the calorimeter energy scale is discussed in Section [5j 

4.3.4 Missing Transverse Energy 

Neutrinos can only be identified indirectly by the imbalance of the event in the transverse 
plane. A feature of lepton+jets and dilepton tt events is thus significant missing transverse 
energy fix- The missing transverse momentum is reconstructed from the vector sum of all 
calorimeter objects, i.e. using finer granularity than the reconstructed jets and thus taking 
into account also small additional energy deposits [521133]. The missing transverse momentum 
vector is corrected for the energy scale of jets and for muons in the event. For the selection of 
lepton+jets events typically a missing transverse energy of fir > 20 GeV is required; the cut 
value for dilepton analyses is usually higher. 

The unclustered transverse energy E T ncl is defined as the magnitude of the vector sum of 
transverse energies of all calorimeter objects that are not assigned to a jet or charged lepton. 

4.3.5 Identification of Bottom Quark Jets 

A tt event contains two bottom quark jets, while jets in background events predominantly 
originate from light quarks or gluons. This is why the signal to background ratio is significantly 
enhanced after the requirement that at least one of the jets is 6-tagged. In addition, the number 
of relevant assignments of reconstructed jets to final-state quarks (jet-parton assignments) can 
be considerably reduced with 6-tagging information. 

Three different signatures can in principle be used to identify bottom-quark jets: 

• The presence of an explicitly reconstructed secondary vertex corresponding to the decay 
of the bottom-flavored hadron, 

• a low probability for all charged particle tracks in the jet to come from the primary 
event vertex (which again implies the existence of a displaced secondary decay vertex), 
or 

• the presence of a charged lepton within the jet from a semileptonic bottom or charm 
hadron decay. 

To date, for measurements of the top quark mass using b tagging, explicit secondary vertex 
reconstruction is used, which proceeds as follows [331136]. Tracks in the jet passing a pr cut are 
selected if they have significant impact parameter relative to the primary event vertex. CDF 
rejects poorly reconstructed tracks based on the hits and the track fit x 2 ; DO rejects tracks 
from K® and A decays and requires that the impact parameter of any track used in secondary 
vertex finding have a positive projection onto the jet axis (negative when determining the 
mistag efficiency, see below). Jets are called taggable if they contain at least two tracks that 
pass these criteria. These tracks are used to form secondary vertices; if a vertex is found 
with a large positive decay length significance L xy /a(L xy ) (> 3 for CDF and > 7 for DO) 
the jet is called 6-tagged. The distance L xy in the xy plane between primary and secondary 
vertex is multiplied by the sign of the cosine of the angle <fi between the vector pointing from 
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the primary to the secondary vertex and the jet momentum vector. While a large positive 
value of L xy is a sign for a decay of a long-lived particle, the distribution of negative values 
contains information about the L xy resolution. Jets tagged with negative L xy are used in 
the determination of the mistag efficiency, i.e. the efficiency with which non-6 quark jets are 
erroneously tagged, see Section 15.31 



4.4 Backgrounds and tt Event Selection 

Two types of background have to be distinguished: (1) physics background where all final- 
state particles are produced but in a different reaction; generally these processes will not 
involve top quarks, but misassignment of top quark events to the wrong event topology also 
has to be taken into account; and (2) instrumental background, where part of the event 
is mis-reconstructed. At a hadron collider, instrumental background mainly involves jets 
that lead to wrongly identified isolated leptons. Together with the backgrounds, a general 
outline of the event selection for the different tt topologies is given below; concrete examples 
of event selection criteria are described more fully later together with the top quark mass 
measurements. 



4.4.1 Dilepton Events 

Physics background in the dilepton channel arises from all processes leading to a final state 
with two charged leptons of opposite charge and two jets. For the ee and /i/z channels, the 
largest background is from Drell-Yan events containing two additional jets. These events 
can be efficiently removed by requiring a minimum charged lepton px (to remove low-mass 
resonances), inconsistency of the dilepton invariant mass with the Z mass, and significant 
missing transverse energy. For all dilepton channels, Z/'y* — > tt events with two leptonic r 
decays as well as diboson events (the WW cross section is largest, but WZ events also have to 
be taken into account) with leptonic W decay remain. For the dilepton channels as well as the 
other channels, misidentification of ti events containing tauonic W decays with subsequent 
leptonic r decay has to be accounted for. 

Instrumental background in the dilepton channel arises mainly from events with one lep- 
tonic W decay and three jets, one of which is mis-identified as another lepton. Jets can appear 
as isolated electrons if they contain a leading n° — > 77 decay, resulting in large electromag- 
netic energy deposition in the calorimeter, possibly with a track pointing at it from conversion 
(7 — > e + e~) of one of the photons, and only little surrounding jet activity. Additional contri- 
butions come from semileptonic bottom or charm hadron decays within jets. 

Leptons from r decays and jets not from top quark decay have mostly small transverse 
energies. To select ti dilepton event candidates, the experiments thus typically require two 
charged leptons of opposite charge with large Ex and spatially isolated from jet activity, two 
large--&r jets, and significant missing transverse energy. Most of the remaining background 
can be removed by requiring jets to be 6-tagged; however, this is often not desirable for small 
data samples. 
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4.4.2 Lepton+Jets Events 

Leptonic W decays produced in association with jets, which lead to instrumental background 
for dilepton events, are the main physics background for tt events in the £+jets channel. 
Another physics background is from electroweak single top production with additional jets. 
Diboson events contribute when in contrast to above, one leptonic W decay occurs together 
with another hadronic weak boson decay. Background from events with a leptonic Z decay 
can be removed by rejecting events with more than one isolated energetic charged lepton. 
Similarly, background from Zj 7* — > tt events arises if one r decays leptonically and the other 
hadronically. 

Instrumental background in the £+jets channel is due to QCD multijet events with at least 
five jets, one of which is mis-identified as a lepton as described above. 

Lepton+jets tt events are selected by requiring one isolated charged lepton with large 
Et-, normally four large- Et jets at least one of which is 6-tagged (both requirements can be 
relaxed), and significant missing transverse energy. 

4.4.3 All-Jets Events 

The overwhelming background in the all-jets channel is from QCD multijet events that contain 
six or more reconstructed jets. Most of this background does not contain b jets, and the 
kinematic properties of the jets differ slightly from those of jets in signal events. The selection 
relies on a combination of b tagging and kinematic criteria. Since the QCD multijet process 
cannot be reliably simulated and the total background has to be estimated from the data, 
there is no need to explicitly account for individual subdominant background processes. 

4.5 Jet-Parton Assignment 

In most analyses, in particular those based on explicit top quark mass reconstruction, the 
reconstructed jets need to be assigned to the final-state quarks from the tt decay to measure 
the top quark mass. Depending on the tt topology, different numbers of possible jet-parton 
assignments have to be considered; for all-jets events, 90 different assignments have to be 
distinguished. In £+jets and all-jets events, the number of relevant assignments can be reduced 
when 6-tagged jets are present, which are likely to be direct top quark decay products. 

A further complication arises when additional jets are present in the event. Since jets 
from initial-state radiation, from the underlying event (interactions involving the proton or 
antiproton remnant), or from additional hard interactions in the same beam crossing typically 
have small transverse energy Et, many analyses consider the n highest-Er jets as tt decay 
products, where n = 2, 4, 6 in the dilepton, £+jets, and all-jets topologies, respectively. 

The issue of jet-parton assignment is further discussed in Sections [7J El and M together 
with the individual analyses. 

4.6 Simulation 

Monte Carlo simulated events are used for several purposes in the analyses: 

• to compare measured and simulated distributions in order to check the detector; 
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• to determine the detector resolution; 

• to optimize the selection and determine the fraction of signal events in the selected data 
sample; 

• to calibrate the methods for measuring the top quark mass; and 

• to compare the top quark mass uncertainty obtained in the data with the value expected 
for the measured fraction of signal events. 

Simulation programs are based on the factorization scheme (cf. Section [37X7) . and in general, 
separate program libraries can be used to model the hard interaction, additional gluon and 
photon radiation in the initial and final state, the parton distribution functions, hadroniza- 
tion, decays of unstable particles, and the detector response. Interference between different 
processes populating the same experimental final state is usually neg lectecfl. This is a good 
approximation since the final-state color, flavor, and spin configurations are in general dif- 
ferent: For example, £+jets ti production can only interfere with those W^+jets events that 
contain a bb pair and two additional quarks (but no hard gluons) in the final state. 

The simulation used so far in the Tevatron analyses is based on leading-order matrix 
elements to describe the hard process. The Monte Carlo generators pythia [TS], herwig 
or alpgen [20J are used to generate the hard parton-scattering process in ti events and 
background events involving weak vector bosons (PF+jets events; WW, WZ, and ZZ events; 
single top production; and Drell-Yan events in association with jets). These generators are 
interfaced to leading-order parton distribution functions, in general CTEQ5L [22]. Leading- 
order calculations of total cross sections have large uncertainties, and where possible, absolute 
production rates are scaled to accommodate the data, so that only the prediction of relative 
cross sections is taken from the simulation. 

The simulation of the hard-scattering process is interfaced with pythia or herwig to 
simulate initial- and final-state gluon radiation. Matching procedures have been developed to 
ensure that the phase space regions covered by hard gluon radiation and by gluon emission 
included in the matrix element calculations do not overlap, pythia or herwig are also used 
to model fragmentation and hadronization, and are interfaced with evtgen [50] or QQ [51] 
and tauola [52] to simulate heavy hadron and tau lepton decays. The simulated events 
are passed through a detailed simulation of the detector response based on geant [53] and 
are then subjected to the same reconstruction and selection criteria as the data. A detailed 
general discussion of the event simulation process can be found in [51] , and a list of programs 
used for top quark measurements is given in [55] . 

Depending on the instantaneous luminosity, it is possible that more than one pp or pp 
collision takes place in one bunch crossing. To simulate this effect, minimum bias events 
(events with only very loose trigger requirements) are recorded and superimposed on the 
simulated events. Similarly, pileup of signals from collisions in subsequent bunch crossings is 
simulated by overlaying events recorded with a random trigger. 

Background not involving any leptons from vector boson decay (QCD multijet background) 
is not modeled using Monte Carlo simulation, but estimated from the data using events with 
non-isolated leptons [1U [5B] and/or little Tfs? [57J. An exception is one CDF analysis in the 
all-jets final state, where alpgen is used to model the multijet background [58J. 



2 An exception are Drell-Yan events, where interference between photon and Z exchange is included. 
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The reconstructed energies and momenta in the simulation are smeared such that the de- 
tector resolution agrees with that of the actual data. The modeling of kinematic distributions 
in the simulation is then checked. Signal events are generated for various assumed top quark 
masses in order to calibrate the measurement methods. 
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5 Detector Calibration 

To measure the top quark mass it is not sufficient to merely select ti event candidates. 
An accurate understanding of how the detector responds to the decay products in tt 
events is also indispensable. It is only this second step that allows to relate the 
properties of the events to the value of the top quark mass. The procedures with 
which the experiments calibrate the detector response are outlined in this section. 

An accurate calibration of the energy/momentum scale and resolution for the reconstructed 
particles used to measure the top quark mass is crucial. Also, even though the measured top 
quark mass does not directly depend on the absolute detector efficiency, the dependence of 
the efficiency on particle energies/ moment a and pseudorapidities must be known, too. In this 
section, the calibration procedures used by the Tevatron experiments are introduced. It is 
worth noting that usually a large fraction of the analysis work needed in a top quark mass 
measurement is related to detector calibration. 

5.1 Charged Leptons 

The reconstruction of electrons and muons can be calibrated using Z — > e + e~ and Z — > fi + fi~ 
decays. In addition, information from W — > ev events, cosmic ray muons, and cc and bb 
resonance decays can be used. These events have the advantage that they can be identified 
with low backgrounds, and that the measurement of one particle or by one detector system 
can be cross-checked with another. The electromagnetic calorimeter yields the most precise 
measurement of the energy of energetic electrons, while the central tracking chamber is used 
to measure the muon (transverse) momentum. 

The transverse momentum scale for energetic muons is adjusted such that the recon- 
structed Z mass reproduces the known value. Additional information on the momentum scale 
is obtained from the lower-mass resonance decays J/ip — > [i + and T(IS') — > The 
reconstructed invariant mass distribution of Z — > decays obtained with the CDF ex- 

periment is shown in Figure E^a) [55] • An example of further studies of the momentum scale 
is given in Figure M [60J , which shows the Z — > ji + \i~ mass distribution for DO data in events 
where (1) both muons are isolated and (2) one muon fails the isolation criteria, indicating 
the presence of Bremsstrahlung. The energy scale for energetic electrons is set with the 
reconstructed Z — > e + e~ invariant mass distribution. The distribution obtained by the CDF 
experiment is shown in Figure [S](b). Additional input is obtained from a comparison of re- 
constructed electron energy and track momentum in W — > ev decays as discussed below. The 
resulting uncertainties in the calibration of the absolute muon momentum and electron energy 
scales are negligible for top quark mass measurements (compared with the jet energy scale 
uncertainties, see below). 

The energy /momentum resolution can be studied using Z — > e + e" and Z — > events, 
too. Also, a cosmic ray muon traversing the center of the detector is reconstructed as two 
muons, and the distribution of the difference between the two reconstructed momenta yields 
additional information on the momentum resolution. Similarly, since the calorimeter energy 
measurement is more precise at high energies than the track momentum, a comparison between 
the two quantities in clean samples of isolated electrons can be made to cross-check the track 
momentum resolution. Figure [TD] shows the results of these studies with CDF data [6IJ. 
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Figure 9: Distribution of the invariant / u + mass for selected Z — > [i + ^ events in the DO 
data for isolated muons (histogram) and events where at least one muon does not pass all 
isolation cuts (points with error bars, scaled to the same number of entries) [60]. 
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Figure 10: Distribution of the difference in curvature for the two tracks in a CDF cosmic ray 
event (a), yielding a measurement of the momentum resolution. Distribution of the energy 
divided by momentum inW — )■ ev events at CDF (b) together with a Gaussian fit in the range 
0.8 < E/p < 1.08. Electrons with with significant Bremsstrahlung at large E/p are excluded 
from the fit; their abundance is a measure of the amount of detector material. 



The efficiency to reconstruct an electron or muon can be factorized into several contri- 
butions: trigger efficiency, tracking efficiency, the efficiency to identify the track as electron 
or muon, and the efficiency of further criteria like isolation cuts. All individual efficien- 
cies are measured in the data using Z — y £ + £~ events (CDF determines the tracking effi- 
ciency with W —y ev candidate events using calorimeter-only selection criteria), see for exam- 
ple [601 EH E2]. The concept of the tag-and-probe method in Z — y £ + £~ events is visualized in 
Figure ITT1 A clean sample of Z — y £ + £~ events is obtained using a selection where the criterion 
under investigation is not applied to one of the leptons. The fraction of selected events where 
this lepton also passes the additional criterion is then a measure of the efficiency. 

The calibration of top quark mass measurements relies heavily on the quality of the detector 
simulation. The simulation is tuned (and an additional scaling and smearing is applied where 
necessary) to reproduce the position and width of the Z — y £ + £~ invariant mass peak. This 
can for example become necessary when the description of the detector material or alignment 
in the simulation does not fully reproduce reality. Also, the efficiency in the simulation may 
have to be scaled. 



5.2 Hadronic Jets 

For the same reasons as outlined above in the section about charged leptons, it is crucial to 
have a precise knowledge of the jet energy scale and resolution, and to accurately reproduce 
them in the simulation. In most analyses, the measurement of the top quark mass relies to 
a large extent on the reconstructed jet energies. However, the energy scale and resolution of 
jets is more difficult to determine experimentally than that of charged leptons. Therefore, 
current top quark mass measurements are systematically dominated by the knowledge of the 
absolute jet energy scale [9], and for a given sample and analysis technique the statistical 
error on the top quark mass is dominated by the jet energy resolution (see below for a more 



26 



5 DETECTOR CALIBRATION 



"TAG -ii" 




muon identified 
in the trigger , 
and offline 



"TAG -n" 



z central track 
ix p T >30 GeV 




✓ central track \ 
jxp T > 30 GeV 



isolatei 



isolated / 



track ?? , 




"PROBE -n." 

p T > 15 GeV 



\ central track " 
\p T >20 GeV 



- - muon identified 

isolated _ j n the trigger? 
- offline? 

"PROBE -u" 



Figure 11: Schematic illustration of the tag-and-probe method to measure the tracking ef- 
ficiency (left) and the efficiency of the charged lepton identification in the trigger and off- 
line |Sf . 



detailed discussion). 

In the following, the determination of the jet energy scale, the relevance of the jet energy 
resolution, and the agreement between data and Monte Carlo simulation are discussed. 

5.2.1 Overall Jet Energy Scale 

For the determination of the top quark mass, the momentum vectors of the quarks in the 
final state are needed. However, the detectors measure particle jets, and their directions and 
energies are taken as a measure of the quark momentum. While the direction of the initial 
quark is quite well reproduced by the jet direction, the correspondence between jet and quark 
energies is more involved. This correspondence is established in two steps: 

1. First, the energy of the measured jet is related to the true energy of the particle jet. This 
step depends on detector effects and on the jet algorithm used. 

2. Second, the quark energy is inferred from the particle jet energy. This second step only 
involves the effects of fragmentation and hadronization and is thus independent of the 
experimental setup. Depending on the analysis, this relation can be established via 
Monte Carlo models or via a parametrization with transfer functions. 

In this section, the correction procedures applied by the two Tevatron experiments to obtain 
particle jet energies are outlined; for details, see [63, 64J. The transition to quark energies 
is regarded as part of each specific top quark mass measurement and is described later in 
Sections together with the individual analyses. 

The transition from measured to true particle jet energies requires several corrections: 

• Energy Offset Eq- Before corrections are made, the energy scale for the electro- 
magnetic calorimeter is set such that the Z — > e + e~ peak is correctly reproduced, as 
described in Section 15.11 Contributions from detector noise, energy pile-up from pre- 
vious bunch crossings, additional interactions in the same bunch crossing ("multiple 
interactions"), and the underlying event, i.e. reactions of partons in the proton and an- 
tiproton other than those that initiated the pp — > ti process, are then subtracted from 
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the measured jet energy. The correction for this energy offset Eq depends on the jet 
algorithm and parameters (e.g. the cone size), the pseudorapidity, and the instantaneous 
luminosity. The DO experiment determines it from energy densities in minimum bias 
events. 

Calorimeter Response R: The second correction concerns the calorimeter response. 
There is no straightforward way to determine the response with a resonance similar to 
the procedure applied for electrons and muons based on leptonic Z decays as described 
in Section EIU because hadronic decays of single W or Z bosons cannot be distinguished 
experimentally from QCD dijet events. (An exception are hadronic W decays in ti 
events, which are discussed below.) 

The response to hadronic jets can therefore only be measured with events where a 
jet is balanced by another object for which the detector response is known. The DO 
experiment uses 7+jet events, taking the photon energy scale from Z — > e + e" events. 
In these events, the so-called missing Et projection fraction method allows to measure 
the calorimeter response from the pt imbalance [64] : For an ideal detector, the photon 
transverse momentum p T and the transverse momentum of the hadronic recoil p T ad are 
expected to be balanced. However, before calibration of the calorimeter response an 
overall transverse momentum imbalance fx 7^ may be observed: 

Wp^ + i? ha VT ad = -Pt ■ (6) 

The missing transverse momentum vector is corrected for the electromagnetic calorime- 
ter response TV determined from Z — > e + e~ events. After that, the hadronic response 
is obtained as 

^had = 1 Pt Pt (7) 

(Plf 

In events with one photon and exactly one jet, the jet response can be identified with 
the hadronic response i? had . The calorimeter response depends on the jet energy and 
pseudorapidity; in particular, the response for jets in the overlap regions between the 
central and endcap calorimeters at \if ct \ w 1 is different from that for jets fully con- 
tained in one of the calorimeters. These effects are taken into account by measuring 
the response as a function of both pseudorapidity and estimated jet energy. Since the 
energy resolution for jets is broad, the true jet energy in 7+jet events is estimated from 
the photon transverse energy E T and jet pseudorapidity rf et as 

E' = E T cosh(77 jet ) . (8) 



The CDF experiment first measures the dependency of the response on the position 
in the detector (as for DO, the response is not expected to be uniform because of gaps 
between the individual parts of the calorimeter and because of their different responses); 
after applying these 7] dependent corrections the absolute jet energy scale is determined 
from a Monte Carlo simulation of the detector and cross-checked with results of the 
missing Et projection fraction method described above |63J . The simulation is tuned to 
model the response to single particles by comparing the calorimeter energy and track 
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Figure 12: The CDF experiment uses dijet events with a trigger jet within 0.2 < \t)\ < 0.6 to 
obtain rj dependent corrections to the jet energies. Shown is the ratio of the second (probe) jet 
Pt and the trigger jet p T as a function of probe jet pseudorapidity for various average jet px 
regions and for data and Herwig and Pythia simulated events as explained in the figure Wc 



momentum measurements for single tracks, using both test beam data and CDF data 
taken during Tevatron Run II. Because of the limited tracking in the forward regions, 
this procedure is used for the central calorimeter only, and the forward calorimeter 
response is determined relative to the one for the central calorimeter. The r\ dependent 
corrections are obtained by balancing dijet events and are shown in Figure [12j Because 
the simulation only describes the data well for values of \r)\ up to about 1.4, separate 
corrections are derived for data and pythia Monte Carlo simulation; herwig events are 
not used because of the large discrepancies for \r]\ > 1.4 and px < 55 GeV. An indirect 
determination of the response for jets, inferred from the momenta of the tracks within 
the jet, is shown in Figure [J3j 



• Showering Correction S: The first two corrections are specific to each experiment 
and yield jet energies that are independent of the experimental setup, but still depend 
on the jet finding algorithm. In general, not all energy deposits belonging to the jet are 
assigned to it by the jet algorithm, and thus a fraction of the energy is not accounted 
for in the measured jet energy. The energy fraction assigned to the jet is a function of 
the jet algorithm and its parameters, the jet energy itself, and the pseudorapidity. 
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Figure 13: The response for jets in the CDF experiment as a function of the jet transverse mo- 
mentum, for data as well as simulated events. The response is determined indirectly from the 
track momenta, and the deviation of the jet response values from 1.0 is due to the calorimeter 
response to hadrons being smaller than unity [63]. 



The particle jet energy E cori is thus obtained from the raw measured energy E meas as 

E corr = E™"(a)-E (g,C,r,) 

R(a,E meas } 7]) S (a,E meas ,ri) ' ^ ' 

where Eo, R, and S are the three corrections described above, depending on the jet algorithm 
and its parameters, denoted by a, the instantaneous luminosity £, the pseudorapidity 77, and 
the jet energy itself. An example of the different contributions to the uncertainty on the jet 
energy scale is given in Figure [HI for the CDF experiment. This uncertainty on the overall 
energy scale for jets leads to the dominant systematic error on the top quark mass unless the 
scale is determined simultaneously with the top quark mass from the same events. 

Hadronic W decays in tt events provide a means of calibrating the energy scale for light- 
quark jets with the same event sample for which the calibration is needed to measure the top 
quark mass. Such an in situ calibration is very attractive experimentally since one becomes 
independent of uncertainties due to e.g. the photon selection, the jet flavor composition of 
7+jet events, or Monte Carlo simulation. However, at the Tevatron the size of the tt event 
samples is not sufficient to calibrate the jet energy scale as a function of pseudorapidity and 
energy. Therefore, all jet energy corrections described above are still applied, and in situ 
calibration is then used only to determine the overall energy scale for all jets. With this 
approach, the largest part of the jet energy scale systematic error on the top quark mass 
can still be absorbed in an increased statistical uncertainty. If desired, the information on 
the overall scale parameter from 7+jet events or Monte Carlo calibration can be used as 
an external prior to further reduce the uncertainty. Analyses using in situ calibration are 
presented in detail in Sections 17.11 El and 

5.2.2 Bottom-Quark Jet Energy Scale 

Even for a given momentum of the parton initiating a jet, both the frequency with which the 
various hadron species are produced and their momentum spectra are different for quark jets 
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of different flavor or gluon jets. The experiments in general distinguish between bottom-quark 
and light-flavor jets, where the latter includes any jet that is not initiated by a bottom quark. 

Because the particle momentum spectrum differs, the ratio of electromagnetic to hadronic 
energy is different, thus leading to a different response for bottom-quark and light-flavor jets. 
A further correction is necessary for jets containing neutrinos which are not measured at 
all, and for muons which only deposit a small fraction of their energy in the calorimeter. 
This correction is relevant for jets containing semileptonic heavy hadron decays. An explicit 
correction can be applied for jets in which the charged lepton is identified inside the jet (only 
muons are used at the moment). The response for bottom-quark jets without an identified 
muon will still be shifted due to unidentified semimuonic and semielectronic heavy hadron 
decays. The showering correction will in principle be different for bottom- and light-flavor 
jets as well, due to the mass of the decaying bottom hadron. 

In practice, the full jet energy corrections are derived as described above for light-flavor 
jets (for example, most 7+jet events will not contain bottom-quark jets). For bottom-quark 
jets, additional corrections are applied to this jet energy scale, and systematic uncertainties 
are quoted both for the overall (light-flavor) jet energy scale and for the relative scale for 
bottom-quark and light-flavor jets; for details, see for example [SSI EH]- 

5.2.3 Jet Energy Scale Corrections Specific to ti Events 

In addition to the general corrections described so far, the CDF experiment applies specific 
corrections to the energy scale of jets in ti events. These corrections account for the px spectra 
and jet flavors encountered in ti events, which are different from those of the events for which 
the general corrections have been derived (the jets in ti events are initiated by quarks, two of 
which are bottom quarks, and one charm-quark jet is expected in every second hadronic W 
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decay, while gluon jets are only expected if additional radiation occurs). The corrections are 
derived from simulated events as described in [SB]- Light- and bottom-quark jets are corrected 
differently, and thus this last correction can only be applied once a jet is assigned to a final 
parton. In contrast, the DO experiment absorbs these corrections into the transfer functions 
used in the Matrix Element and Ideogram analyses. For corrections that are identical for 
both data and simulation, the measured top quark mass is not systematically shifted since 
the measurement calibration is based on the simulation. The correction may however lead 
to an improvement of the statistical sensitivity in template-based measurements where no m t 
dependent likelihood is derived on an event-by-event basis. 

5.2.4 Relative Jet Energy Scale Between Data and Simulation 

In all top quark mass measurements, simulated events are used to calibrate the measurement 
technique. Thus, if the corrected jet energies in the data systematically do not reproduce the 
particle jet energies, and the same effect is present in Monte Carlo simulated events, the cali- 
bration procedure assures that the top quark mass is still measured correctly. Consequently, 
only uncertainties on the relative data/Monte Carlo jet energy scale enter the systematic error 
on the top quark mass. 

5.2.5 Jet Energy Resolution 

For a given event sample and analysis technique the statistical uncertainty on the top quark 
mass is dominated by the jet energy resolution. To illustrate this, events with a top quark 
involving a leptonically decaying W have been passed through the full simulation of the 
DO detector, and the effect of the detector resolution on the reconstructed top quark mass 
distribution is studied. Of the three top quark decay products, either for the bottom quark or 
the charged lepton the reconstructed momentum vector is taken, while the true momentum 
vectors are used for the other two decay products. The results in the left plot of Figure [TBI 
show that the inclusion of the jet resolution has the largest effect on the distribution of the 
reconstructed top quark mass. The tails visible when using the reconstructed muon momentum 
are due to the fact that the momentum resolution degrades with increasing px] the effect on 
the reconstructed top quark mass distribution is demonstrated in the right plot of Figure [15j 

The CDF experiment has tuned the simulation so that not only the mean shower energy 
in single track data is reproduced (which is relevant for the overall jet energy scale, see 
Section I5.2.ip . but also the parameters describing the shower shape [63J. Consequently, the 
jet energy resolution is taken from the simulation. 

At the DO experiment, the jet energy resolution is measured from 7+jet (below a jet Et 
of 50 GeV) and dijet events (above 50 GeV). The same measurement is performed in data 
and Monte Carlo, and the resolution of simulated jets is smeared to reproduce the data. The 
measured top quark mass depends on the modeling of the jet resolution because the event 
selection in general requires a minimum jet Et- Furthermore, an accurate modeling of the 
resolution allows the observed statistical error to be compared with expectations from the 
simulation. 
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Figure 15: Simulated top quark mass distributions for top quark decays at m t = 175 GeV 
involving a leptonic W decay, taking the momentum as reconstructed with the DO detector for 
(a) the electron, (b) the muon, or (c) the b jet, and the true momenta for the other two decay 
products. The effect for muons of different true transverse momentum is shown separately in 
plots (bl) to (b6). The mean and width of the top quark mass distribution are determined with 
a Gaussian fit and given in units of GeV. 
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5.3 Efficiency of Bottom-Quark Jet Identification 

When the efficiency to identify 6-quark jets is defined for taggable jets (cf. the definition 
of taggability in Section 14.3.51) . it becomes independent from detector inefficiencies. The 
taggability is related to the efficiency with which tracks are reconstructed. To take into account 
the geometrical acceptance of the silicon detector, the DO experiment measures the taggability 
of jets in bins of the quantity \z pv \ x sign (z pv r]i et } where z pv and if et are the z position of the 
primary vertex and the pseudorapidity of the jet, respectively [14] . The measured taggability 
is parametrized as a function of jet transverse momentum and pseudorapidity, and the relative 
taggabilities of light, charm, and bottom jets are determined from the simulation. The results 
of the study are shown in Figure [16j 

Both CDF and DO determine the b-tagging efficiency for bottom-quark jets and the mistag 
rate for light-flavor jets from data, with additional corrections based on the simulation [14], |4"B]. 
The efficiency for bottom-quark jets is measured on a dijet event sample whose bottom-quark 
content is enhanced by requiring the presence of an electron (CDF) or a muon (DO) within 
one of the jets as an indication of a semileptonic heavy hadron decay. 

The CDF experiment determines the bottom-quark content of their calibration sample by 
reconstructing D° — > K~tt + decays or muons in the jet containing the electron, both of which 
are additional signatures for a heavy hadron decay. The DO experiment uses the transverse 
momentum spectrum of the muon relative to the axis of its jet to measure the bottom-quark 
content of the calibration sample. 

Both CDF and DO thus measure the fe-tagging efficiency for bottom-quark jets with a 
semileptonic decay. Corrections to obtain the efficiency for inclusive bottom-quark jets are 
derived from the simulation. The CDF experiment also takes the dependence of the 6-tagging 
efficiency on jet energy, pseudorapidity, and track multiplicity from the simulation, while the 
overall normalization is determined from the data measurement. 

Both CDF and DO measure the light-flavor tagging rate (mistag rate) on the data using the 
rate of jets that contain a secondary vertex with negative decay length significance L xy /a(L xy ) 
(cf. Section f4.3.5l) . After correction for the contribution of heavy-flavor jets to such tags and 
the presence of long-lived particles in light-flavor jets, this rate is a measure of the probability 
that a light-flavor jet gives a secondary vertex tag with positive L xy /a(L xy ). The 6-tagging 
efficiency for charm-quark jets cannot easily be determined from data, and thus the ratio of 
efficiencies for charm- and bottom-quark jets is taken from the simulation. 

The 6-tagging efficiencies of the CDF and DO secondary vertex tagging algorithms for 
taggable jets are shown in Figure [T71 
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Figure 16: Jet taggability measurements by the DO experiment JJ$ . (a): Definition of tagga- 
bility regions, (b)-(e): Taggability measured in the data as a function of jet pt ((b), (d)) and 
jet \r]\ ((c), (e)) for jets in the various regions defined in (a), (f), (g): Taggability ratios of b 
to light (full circles) and c to light (open squares) jets determined from simulated events. 
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Figure 17: Secondary vertex b-tagging efficiencies for taggable jets, (a): CDF experiment 
Efficiency for bottom- quark jets with \q\ < 1 as a function of jet Et together with the ±ler 
uncertainty range, (b)-(e): DO experiment /^/. Measured efficiency for bottom- quark jets as 
a function of (b) jet p T and (c) jet \r}\ and for charm-quark jets as a function of (d) jet pt 
and (e) jet \n\; the ±lcr uncertainties of the fits are indicated by the red dotted lines. 
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6 Methods for Top Quark Mass Measurements 

So far, the report described the selection of top-quark events and the calibration 
of the detectors. This section now introduces a classification of the methods to 
determine the top quark mass from the events selected. Following this classification, 
the subsequent sections then give details about each of the methods together with 
concrete examples. 

Experimental results for the top quark mass can be grouped according to the ti decay 
channel analysed, cf. Section 13.21 A comparison between the measurements in different chan- 
nels allows to search for an indication of differences and thus for new physics effects beyond 
the Standard Model. 

In this section, another classification is introduced according to the measurement technique 
applied. Even though top quarks decay before hadronization, the information on the top 
quark mass is still diluted in the events measured in the detector by physics effects (initial- 
and final-state radiation and hadronization) and the detector resolution. In broad terms, the 
following different approaches have been followed by the Tevatron experiments to deal with 
this complication: 

Template Method: A measurement quantity per event called estimator (mostly a single 
number, but in some measurements a vector of numbers) that is correlated with the top quark 
mass is computed per event. Any measured quantity in the event that is correlated with the 
mass of the decaying top quark can be used as estimator in the analysis. In all cases, it is 
mandatory to understand the exact top quark mass dependence of the distribution of this 
quantity. 

In lepton+jets and all-jets events where enough decay products are reconstructed, the 
smallest statistical error is obtained if the invariant masses of the two top quarks are explicitly 
reconstructed to obtain the estimator; this is however not necessary (and not possible in 
dilepton events without additional assumptions because of the two neutrinos in the final 
state). In addition, alternative techniques have been developed for lepton+jets events to 
reduce the sensitivity to systematic errors by a careful selection of the estimator. These are 
already being explored at the Tevatron and will become much more important at the LHC 

The distribution of the estimator for the set of selected data events is compared with the 
expected distribution for various assumed values of the top quark mass. This so-called template 
distribution is generated using simulated signal and background events, taking efficiencies and 
the relevant cross sections into account. The values of the estimator in the data events are 
then compared to the template distributions in a fit to determine the top quark mass. To 
increase the statistical power of the method, the event sample is often divided into subsamples 
with different signal purity, for example according to the number of 6-tagged jets per event. 

Matrix Element Method: For each selected event, the likelihood to observe it is calculated 
as a function of the assumed top quark mass. To this end, all possible reactions yielding final 
states that could have led to the observed event are considered. An integration is performed 
over all possible momentum configurations of the final state particles for all relevant reactions. 
In this integration, the probability of the colliding partons to have a given momentum fraction 
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of the proton or antiproton is taken into account using the appropriate PDFs. Similarly, the 
likelihood to obtain the detector measurement for an assumed final state is accounted for by a 
transfer function that relates an assumed final-state momentum configuration to the measured 
quantities in the detector. Here, a choice can be made which detector measurements to use in 
the analysis; for example, the measured missing transverse momentum is not explicitly used 
in the Matrix Element measurements in the lepton+jets channel. 

The Matrix Element method accounts for the fact that the accuracy of the information on 
the top quark mass contained in different events is in general different: 

• Depending on the event kinematics and characteristics like the quality of a b tag, some 
selected events have a higher likelihood of being a ti event for a certain top quark mass 
than others. 

• Depending on the energies and directions of the final-state particles, the resolutions of 
the measured momenta and thus of the top quark mass generally differ between events. 

Because a likelihood as a function of assumed top quark mass is calculated separately for each 
event, and the likelihood for the entire event sample is obtained as the product of the individual 
event likelihoods, each event contributes to the measurement with its appropriate weight, and 
the Matrix Element method minimizes the statistical uncertainty of the measurement. 

However, the integration over final-state momenta is complex, and it is impossible in prac- 
tice to use full detector simulation to evaluate the transfer function during the integration. 
Simplifying assumptions are thus made in the integration, and the measurement is then cali- 
brated using fully simulated events. Still, a Matrix Element measurement requires significantly 
more computation time than a template analysis. The Matrix Element method was first used 
by DO for Tevatron Run I data [52], where it yielded the single most precise measurement, 
and it also currently yields the single most precise measurement at Run II [66J. 

Ideogram Method: The Ideogram method can be regarded as an approximation to the 
Matrix Element method. It does not make use of the full kinematic characteristics of each 
selected event, but only relies on information about the invariant masses of the top and antitop 
quarks and W bosons. The description of wrong jet-parton assignments and of background 
events is even further simplified. 

The statistical sensitivity is not substantially reduced relative to that of the Matrix Element 
method if the signal to background ratio is large. In particular, the method retains the benefits 
of using a per-event likelihood as a function of assumed top quark mass. The computations 
are however simpler than for the Matrix Element method, making the Ideogram method a 
candidate for future analysis of large-statistics data samples. 

If the measured jet energies are used in the computation of the estimator or in the transfer 
function in order to minimize the statistical uncertainty, the dominant systematic uncertainty 
is due to the hadronic jet energy scale. The dependence of the top quark mass on external 
jet energy scale measurements and thus the associated systematic uncertainty is significantly 
reduced if an overall jet energy scale factor can be fitted simultaneously with the top quark 
mass using the same ti events. The information on the jet energy scale comes mainly from 
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hadronic W — > qq' decays where a constraint to the knowrjfl W mass can be used. This in situ 
calibration also reduces the systematic correlation between measurements on different event 
samples, which further improves the combination of Tevatron results. 

In Section[7]a description of the estimators used in the various template measurements at the 
Tevatron is given. Section [8] gives details about the computation of the mass-dependent event 
likelihood in the measurements using the Matrix Element method, and Section [9] outlines the 
analyses using the Ideogram method. The final top quark mass determination from an event 
sample is described in Section [TUJ and systematic uncertainties are discussed in Section [TTJ 
Since there is no no a priori fundamental difference between the CDF and DO experiments on 
the level of reconstructed tt decay products, the following sections describe the measurement 
techniques based on individual analyses as examples. A full account of all results is given in 
Section [T2l 



3 The W mass is known from LEP and the Tevatron j3] with a precision far beyond what is needed for the 
determination of the hadronic energy scale. 
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7 The Template Measurement Method 

The template method is the "standard" technique used at the Tevatron to determine 
the top quark mass from the selected ti events. This section introduces the key 
concept of the template method, the per-event estimator of the top quark mass. Based 
on recent Tevatron measurements as examples, the calculation of estimators for the 
various tt event topologies is described. 

In this section, template-based measurements of the top quark mass are reviewed. The gen- 
eral description of the analyses is complemented with concrete examples from recent Tevatron 
measurements. 

Measurements using the template method are based on the determination of an estimator 
for each event, which is a quantity that captures the information about the top quark mass 
contained in the event. Depending on the event topology analyzed, but also depending on 
the relative importance of statistical and systematic errors, various choices of estimators are 
possible. In addition, a second estimator is introduced when the jet energy scale is measured 
simultaneously with the top quark mass. The different choices of estimators used in Tevatron 
analyses and their computation are described in this section. 

The values of the estimator for the selected data events are compared with the expected 
distribution as a function of the true top quark mass. These expected estimator distributions 
(templates) are generated using simulated signal and background events for a discrete number 
of true values of the top quark mass. Trigger and selection efficiencies and the appropriate 
cross sections (as a function of the top quark mass) are taken into account. The comparison 
of the sum of signal and background templates for various top quark mass hypotheses with 
the observed estimator distribution in the data then yields the likelihood to observe this event 
sample as a function of assumed top quark mass. The top quark mass is usually extracted in 
a fit of the measured events to the generated template distributions, for which a continuous 
parametrization of the templates as a function of true top quark mass is obtained. This 
parametrization is also described in this section. The fitting procedure to determine the top 
quark mass is then outlined in Section [TUJ 

To the extent that the simulation used to derive the template distributions is accurate, the 
fit yields a measurement of the true top quark mass. Possible deficiencies of the simulation 
have to be studied and corresponding systematic uncertainties assigned, as described in general 
in Section [TTJ 

To extract mass information from the top quark decay products, any measured quantity in 
the event that is correlated with the mass of the decaying top quark can be used as estimator. 
In £+jets and all-jets events, taking the explicitly reconstructed invariant top quark mass 
as estimator allows to extract the largest statistical information per event. An example for 
the £+jets channel is described in detail in Section 17.11 Alternative estimators have been 
developed to reduce the sensitivity to systematic uncertainties by a careful selection of the 
quantity that is used to determine the top quark mass. For an example of a Tevatron study, 
see Section 17.21 Such techniques will become more important at the LHC with its much 
larger tt data sets. The all-jets decay channel is described in Section 17.31 Dilepton events 
do not allow a full reconstruction of the event kinematics if the top quark mass is assumed 
to be unknown. Therefore, techniques have been developed to determine a likely kinematic 
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configuration by making additional assumptions, and the top quark mass reconstructed for 
this configuration is then used as the estimator, see Section 17.41 

7.1 Full Kinematic Reconstruction of Lepton+Jets Events 

The full kinematic reconstruction of £+jets events with a kinematic fit that assumes a ti event 
configuration yields a fitted top quark mass per event that can be used as estimator of the 
true top quark mass. This technique is most commonly used in template based measurements 
in the £+jets decay channel at the Tevatron. 

The top quark mass information is then mostly based on the measured jets. Both CDF 
and DO determine a reference jet energy scale, including 77 and E T dependence, as described 
in Section I5.2[ and correct jet energies and missing transverse momentum according to this 
scale. With the current Tevatron statistics, measurements in the £+jets channel would be 
systematically limited by the uncertainty on the overall jet energy scale factor JES, unless 
this factor is determined in situ from the same data as well. The reconstructed hadronic W 
mass is highly correlated with the jet energy scale, but not with the reconstructed top quark 
mass. It can therefore be used as an additional estimator to measure an overall deviation from 
the reference jet energy scale. 

As an example, the CDF template measurement [381 EZ] is described in detail here. 

Event Selection: The ti candidate events are selected by requiring the presence of one 
electron or muon, missing transverse energy, and four or more jets, as outlined in Section 14731 
The electron (or muon) must have a transverse energy (or momentum) larger than 20 GeV 
and pass quality and isolation criteria, and missing transverse energy of at least 20 GeV is 
required. The transverse energy requirements on the jets (reconstructed with a cone algorithm 
with radius ATI = 0.4 as described in Section 14.3.3ft depend on the event category described 
below. 

In hadronic W decays almost exclusively light-quark jets (including charm jets) are pro- 
duced; thus 6-tagged jets are likely to be direct top-quark decay products. This analysis does 
not consider 6-tagged jets as W decay products. Thus, the number of jet-parton assignments 
is reduced in events with 6-tagged jets. As it is then more likely that the kinematic fit selects 
the correct assignment, the reconstructed mass distribution becomes sharper, and these events 
contribute more mass information. Furthermore, the signal to background ratio is larger in 
events with b tags. Finally, the jets in background events have mostly low transverse energies. 
Consequently, the events are grouped into four categories with different jet Et cuts to improve 
the statistical error of the measurement. The criteria for the classification are summarized in 
Table H 

Estimator of the Top Quark Mass: Every selected event is subjected to a kinematic fit, 
and the jet-parton assignment that yields the best \ 2 f° r a tt hypothesis is chosen to compute 
the estimator of the top quark mass. The fit uses the reconstructed charged lepton, the four 
highest Et jets, and the unclustered momentum^ as inputs. Of the 24 possible assignments 



4 The presence of an energetic neutrino is signaled by large missing transverse energy. However, the missing 
transverse momentum vector is derived experimentally as the vector sum of the momenta of all energy deposits 
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Category 


2-tag 


1-tag (T) 


1-tag (L) 


0-tag 


b tagged jets 


n b > 2 


n b = 1 


n b = 1 


n b = 


Et of three leading jets 


E T >15 GeV 


E T >15 GeV 


E T >15 GeV 


£ T >21 GeV 


Et of fourth jet 


E T > 8 GeV 


£ T >15 GeV 


E T > 8 GeV 


£ T >21 GeV 


expected S : B ratio 


10.6 : 1 


3.7 : 1 


1.1 : 1 


no constraint used 



Table 2: CDF lepton+jets template measurement 138^ : Event selection requirements for the 
four categories together with the expected ratios of signal and background events used as con- 
straints in the measurement. Events in the 1-tag (T) category are excluded from the 1-tag (L) 
category. 



of four jets to four final-state partons, 12 need not be considered as they correspond to an 
interchange of the two jets assumed to come from the hadronic W decay, yielding identical 
reconstructed invariant masses and thus no change to the x 2 value described below. For each 
of the remaining 12 assignments, two possible solutions for the z component of the neutrino 
momentum exist for a given value of the W mass. Therefore, 24 different combinations need 
to be considered in the 0-tag event category; there are 6 combinations for events with one 
6-tagged jet; for 2-tag events, two combinations exist. 

The following \ 2 is minimized for all different combinations: 



X 



i, fit i, meas\ 2 /Luneli' fit „uncW. meas\ 2 

,p T p T J \D T p T I 



£ - — ^ — -+£ 



i=lepton, 1 3=%,y •> 

4jets 

(mf u -M w ) 2 (mfj-Mw) 2 «-<")' , Wr^f nm 

r 2 + -p 2 + -p 2 + p 2 • (,1UJ 

1 W 1 W 1 t 1 t 

Here, the symbol p\ meas denotes the transverse momenta of the charged lepton and four high- 
est Et jets as measured in the detector, and the cr, are the corresponding uncertainties. The 
jet and lepton angles are assumed to be well-measured and are not varied in the fit. The 
transverse momentum varied in the fit is called p T *. Similarly, the measured unclustered 
momentum components along x and y, their uncertainties, and the fitted unclustered momen- 
tum enter the \ 2 in the second term. The masses of the two W bosons and the two top quarks 
in an event need not be equal, but can vary around the W boson mass, My/ = 80.42 GeV, and 
the reconstructed top quark mass, m£ eco , according to the decay widths. The Breit-Wigner 
resonances are approximated with Gaussians with widths rv and T t . In each step of the 
minimization procedure the calculated masses m fit are compared with My/ and m T t eco . Note 
that the quantity m T t cco extracted from each event is not a direct measurement of the top 
quark mass; rather, this quantity is the estimator whose distribution for all data events is 
then compared with the corresponding distribution in simulated samples. To obtain different 



in the calorimeter, and thus the uncertainty on the missing transverse momentum depends on the jet activity 
in the event. The uncertainty on the unclustered momentum is however approximately independent of the 
rest of the event. Therefore the unclustered momentum is used in the kinematic reconstruction. 
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values for the two neutrino solutions, the two possible values for the neutrino z momentum 
are computed assuming the nominal W mass and are used to initialize the fit. 

For each event, the estimator m\ eco is taken to be the value obtained from the combination 
that yields the smallest y 2 . If this x 2 is greater than 9, the event is rejected altogether from 
the top quark mass fit. The m\ cco distributions for ti signal events with m t = 178 GeV and 
the default jet energy scale are shown in Figure [TBI for all four event categories. It is evident 
that events in the 2-tag category have the best m\ eco resolution. 



"o1000 

% 800 
O 

in 600 

in 

c 400 

,3 200 



o 

> 

O 
lo 
w 
c 
> 

LU 



2-tag 



r (a) 


| | All Events 




RMS = 27 GeV/c? 




■ Corr. Comb (47%) 


li 


I RMS = 13GeV/c 2 







100 150 200 250 300 350 

m t reco (GeV/c 2 ) 

1-tag(L) 



700 
600 
500 
400 
300 
200 
100 








: (c) {[ 


| | All Events 




RMS = 31 GeV/c 2 




■ Corr. Comb (18%) 




RMS = 13GeV/c 2 

L . . . ^~T- ^ .... I . 



100 



150 200 250 300 

m t reco (GeV/c 2 ) 



350 



2000 
O1800 
>1600 
$1400 
"1200 
^1000 
| 800 
« 600 
> 400 
200 




l-tag(T) 



LLI 




| | All Events 

RMS = 32 GeV/o 2 
■ Corr. Comb (28%) 
RMS = 1 3 GeV/c 2 



100 150 200 250 300 350 

2. 



m, reco (GeV/c ) 
0-tag 




| | All Events 

RMS = 37 GeV/o" 
■ Corr. Comb (20%) 

RMS = 1 2 GeV/c 2 



100 150 200 250 300 350 

m[ eco (GeV/c 2 ) 



Figure 18: CDF lepton+jets template measurement [38]: Template m r t eco distributions for 
simulated signal events with m t = 178 GeV and the default jet energy scale in the 2-tag (a), 
l-tag(T) (b), l-tag(L) (c), and 0-tag (d) categories defined in Tabled The dark blue areas 
show the distributions for events where the correct jet-parton assignment has been chosen. 



Estimator of the Jet Energy Scale: To determine an estimator of the jet energy scale, 
no kinematic fit is applied. The masses rrijj of all dijet combinations that do not involve 
a 6-tagged jet are considered. There are between one (events in the 2-tag category) and 6 
(0-tag category) such combinations per event. The rrijj distributions for ti signal events with 
m t = 178 GeV and the default jet energy scale are shown in Figure fT9l 

Template Parametrization: The measurements of the top quark mass and the jet energy 
scale are obtained from an unbinned likelihood fit of the m\ eco and rrijj values computed in 
the data events to the predictions from the simulation. However, samples of Monte Carlo 
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Figure 19: CDF lepton+jets template measurement f3E/: Template m^- distributions for simu- 
lated signal events with m t = 178 GeV and the default jet energy scale in the 2-tag (a), l-tag(T) 
(b), l-tag(L) (c), and 0-tag (d) categories. The dark blue areas show the distributions for dijet 
pairs that correspond to the W decay products. 
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m, reco (GeV/c 2 ) m..(GeV/c 2 ) 

Figure 20: CDF lepton+jets template measurement ISEj : Template distributions for simulated 
signal ti events in the l-tag(T) category; (a) rrf t eco templates for various true top quark masses 
at Ajes = and (b) rrijj templates for various true values of Ajes at m t = 180 GeV. The 
parametrizations of the template distributions are overlaid. 



simulated events are only available for discrete values of the top quark mass and jet energy 
scale. The solution is to describe the m\ eco and rrijj template distributions with functions 
whose parameters depend on the true values of the top quark mass and jet energy scale. It 
is then possible to continuously vary the values of the top quark mass and jet energy scale 
in the fit of the m\ eco and rrijj values from the data sample. The convention adopted in this 
measurement!! is to consider deviations Ajes from the reference jet energy scale in units of its 
uncertainty a c as a function of jet Et and rj, cf. Figure [TH 

Examples of Monte Carlo template distributions of m\ cco and rrijj for signal ti events are 
shown in Figurel2"Ulfor various values of the true top quark mass and jet energy scale. The same 
functional form is used to describe the m r t eco and rrijj templates. It is a linear combination of 
two Gaussians with independent parameters (to account for those cases where the W or top 
quark masses are well-reconstructed, i.e. where the correct combination has been chosen) and 
a gamma distribution (to describe incorrect combinations) . The nine parameters describing a 
template for a given pair of true (m t , A JE s) values are themselves assumed to depend linearly 
on both of these true values. This assumption is justified because in the measurement, the 
values of m t and Ajes need only be varied in a relatively small range since they are already 
known a priori to a certain precision. In Figure [20], the parametrizations of the template 
distributions are overlaid. 

The m\ eco and rrijj template distributions for background events do not depend on the 
true value of the top quark mass. In principle, there is a dependence on the jet energy scale. 
However, it has been verified that a variation of the jet energy scale affects mostly the overall 
number of background events while leaving the shape of the template distributions essentially 
unchanged. The relative contributions of backgrounds from different sources are kept constant, 



5 In the Matrix Element analyses, on the other hand, the relative deviation JES from the reference jet 
energy scale is measured, as described in Section f8. 4.31 
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Figure 21: CDF lepton+jets template measurement \3B$ : Template distributions for simulated 
background events in the l-tag(T) category with the contributions from individual background 
sources; (a) m\ eca templates and (b) rrijj templates. The parametrizations of the template 
distributions are overlaid. 



only the overall background normalization is allowed to vary (constrained by the expectation 
except in the 0-tag category). Therefore, a single m T t eco and rrijj template distribution is 
used to describe the background in each event category The background templates are also 
described with functions, in this case not to obtain a continuous parametrization but to 
become insensitive to statistical fluctuations due to the limited size of simulated background 
samples. As an example, the m\ eco and rrijj template distributions for background in the 
l-tag(T) category are shown in Figure I2"T1 together with their parametrizations. 

7.2 Estimators Independent of the Jet Energy Scale 

The full kinematic reconstruction of ti events in the £+jets channel relies on measurements of 
four jet energies and thus depends on the determination of the jet energy scale, as explained 
in the previous section. Measurements that are completely independent of the jet energy scale 
will be increasingly important for the overall top quark mass combination with decreasing 
statistical uncertainties. The energy of b quarks from top quark decay is correlated with the 
top quark mass; the energy of the bottom hadron and consequently its decay length (distance 
between production and decay vertex) are then also correlated with the top quark mass. The 
CDF collaboration has used the decay length in the plane perpendicular to the beam axis as an 
estimator in a top quark mass measurement [68]. This measurement is independent of the jet 
energy scale (except for the fact that secondary vertices are only looked for in reconstructed 
jets), but has in principle a larger dependence on modeling of 6-quark fragmentation than 
measurements with full reconstruction of the ti event kinematics. 

The measurement is based on an event sample that is triggered with an inclusive lepton 
trigger and selected requiring the presence of an isolated charged lepton with Et > 20 GeV, 
missing transverse energy Jf? > 20 GeV, and at least three jets with Et > 15 GeV, at least 
one of which is 6-tagged by the presence of a secondary vertex, as explained in more detail 
in Section 14.31 Note that it is not necessary to require the presence of four reconstructed 
jets, as a full reconstruction of the final state is not attempted. The signed decay length 
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Figure 22: CDF lepton+jets measurement using the decay length technique W$J : L xy dis- 
tribution (from jets with L xy > 0) in the data (points with error bars) together with the 
expected contributions from signal and background processes as explained in the plot. The 
simulated distributions have been normalized to the observed number of events. The result of 
a Kolmogorov-Smirnov test is indicated in the plot. 



L xy corresponding to a secondary vertex is calculated as the vector from the primary to the 
secondary vertex, projected first onto the jet momentum vector and then into the xy plane 
perpendicular to the beam axis. 

The measurement is based on the mean decay length (L xy ) in all jets with a secondary 
vertex with L xy > in the selected sample. To check the modeling of the decay length, 
the L xy distributions in doubly 6-tagged dijet events (which contain mostly fe-quark jets) and 
events that contain at most two jets but otherwise pass the tt event selection (which are 
depleted in fe-quark content) are compared between data and simulation. Decay length values 
in the simulation are scaled by the ratio between the mean values (L xy ) found in the data 
and simulation. In Figure [221 the distribution of positive L xy values measured in the data is 
shown together with the expected contributions from signal and background events after the 
scaling procedure. 

The expected distribution of the mean decay length, (L xy ), is then obtained from pseudo- 
experiments using simulated events for true values of the top quark mass between 130 and 
230 GeV. The (L xy ) values are fitted with a third degree polynomial as a function of m t , and 
this parametrization is used to obtain the measurement of m t from the value of (L xy ) observed 
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in the data sample. 



7.3 Estimators in the All-Jets Channel 

As the £+jets channel, the all-jets channel also offers the possibility of full reconstruction of 
the event kinematics; the challenge here is the large background from QCD multijet events, 
and also the combinatorial background. 



Event Selection: In the CDF template analysis in the all-jets channel |13J ES], events are 
used if they pass a multi-jet trigger, contain no isolated energetic leptons and no significant 
missing transverse energy, and between 6 and 8 jets with Et > 15 GeV, \rj\ < 2.0, and 
ATI > 0.5 between the jets. The output of an artificial neural network trained to identify 
tt events is used to further reduce the contribution from QCD multijet background. It is 
calculated from the following inputs: 

• The scalar sum of transverse energies of all jets in the event, and the scalar sum of 
transverse energies of all but the two highest-i^ jets; 

• the centrality C = Ht/v§, where is the invariant mass of the event calculated from 
the reconstructed jets, and the aplanarity A = |Ai; the symbol Ai denotes the smallest 
of the three eigenvalues of the normalized momentum tensor 



where p a is the reconstructed momentum vector of jet a, and i and j are Cartesian 
coordinates; 

• the minimum and maximum dijet and trijet masses; 

• the quantity E^* = E\ sin 2 9\, where E\ is the transverse energy of the highest--Er jet in 
the event, and 9\ denotes its polar angle in the all-jets rest frame, and the corresponding 
quantity for the second-highest--Er jet; and 

• the geometric average over the E^ values for all but the two highest-Er jets in the event. 
The distribution of neural network output values, NN, is shown in Figure I2U1 for the data and 
the expected tt contribution. Events are selected if they satisfy iViV > 0.91. The background 
is further reduced in the subsequent analysis because only events with at least one 6-tagged 
jet are used. 



Estimator of the Top Quark Mass: In every selected event, the 6 highest--Sr jets are 
assumed to be tt decay products and used to reconstruct the event in a kinematic fit to the 
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Figure 23: CDF all-jets template measurement 16$/ : Distribution of neural network output 
values NN for data events (points with error bars) and the expected contribution from ti 
events (solid histogram). 
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tt hypothesis similar to the one described in Section 17.11 with a \ 2 given by 



_ , fit i, mcas N 

x 2 = J2 



i=6 jets 



a? 



(m% ctl -M w y {m% ct2 -M w y (mf* jctl - mff fe.-mrf 

1 W L W L t L t 



Each 6-tagged jet is considered in turn as a b jet (with specific 6-jet energy corrections ap- 
plied), jet-parton combinations that assign it to a W decay product are rejected, and the 
reconstructed top quark mass m\ eco corresponding to the combination that yields the best x 2 
is taken as estimator. Combinations with x 2 > 16 are rejected. There are thus up to n tag 
estimators in an event with n tag tagged jets. 



Template Parametrization: Similar to the procedure described in Section 17.11 the m\ eco 
template distributions for various input top quark masses are fitted with functions whose 
parameters depend on the true value of the top quark mass. Examples of Monte Carlo 
template m\ eco distributions for signal ti events are shown in Figure 1247 a) for various values 
of the true top quark mass. The same functional form as in Section 17.11 is used to describe 
the templates, with linear dependence of the parameters on the true top quark mass. 

The background template is derived from the data. The jet tagging probability is obtained 
from a signal-depleted sample of events with exactly four jets. The background shape is then 
determined from the ti candidate event sample by weighting each jet in turn by its tagging 
probability, rather than imposing an actual 6-tagging requirement. The signal contribution 
to this background template estimate is obtained using the simulation and subtracted. The 
procedure for determining the background shape is validated using events with low neural 
network output iViV. The background template, shown in Figure 1247 b). is parametrized with 
two gamma functions plus one Gaussian. 



7.4 Estimators in the Dilepton Channel 

As explained in Section 13.21 dilepton events are kinematically underconstrained if the top 
quark mass is not assumed to be known. It is therefore not possible to use full reconstruction 
of the event kinematics to obtain an estimator m r t eco as in the £+jets or all-jets channels, cf. 
Sections 17.11 and 17731 Nevertheless, for a given selected dilepton event, some top quark mass 
hypotheses are still more likely than others, and this allows a top quark mass measurement. 
To determine the relative likelihoods of different top quark mass assumptions, an integration 
is performed over undetermined kinematic quantities of the event. Various methods have been 
developed for this integration and for obtaining a top quark mass estimator; in the following, 
these methods are described in turn. The event selection criteria for all analyses are based 
on the general topology of a dilepton ti event as described in Section 14.4.11 Typically, two 
oppositely-charged isolated energetic leptons inconsistent with the Z — > hypothesis, at 
least two energetic jets, and significant missing transverse energy are required, and the two 
highest- E T jets are assumed to be ti decay products. 
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Figure 24: CDF all- jets template measurement 169]: Template m\ eco distributions for (a) signal 
tt events for various true top quark masses and (b) background events. The parametrizations 
of the template distributions are overlaid. 



Neutrino Weighting Method: This method has been used by both CDF and DO in 
Run I [70] and Run II [7TJ [72]. Since the b and b jets are not distinguished in the reconstruc- 
tion, there are two possible jet-parton assignments, corresponding to two possible jet-lepton 
pairings per event. For each pairing, a scan over assumptions for m t and the (anti-)neutrino 
pseudorapidities r\ v and ij^ is performed. Disregarding the measured missing momentum in the 
event, the event kinematics are reconstructed for each assumption by imposing a tt event hy- 
pothesis. This leads to four solutions per (m f , T) v , rfc) assumption. According to the measured 
missing transverse momentum f?, each solution i is assigned a weight Wi of 

wi = exp ! JA-p^-p,,) 2 ^ exp (_ (A-p^-p*,»f , t (13) 

where (a x , a y ) denotes the missing transverse momentum resolution, and p u and p-y are the 
neutrino momenta obtained for the given solution. For a given assumption, the four solutions 
have equal a priori probability; the assumption can thus be assigned a weight of 

4 

w{m u rj u , rfc, pairing) = . (14) 

i=i 

The a priori probabilities of the neutrino and antineutrino pseudorapidities are determined 
from simulated tt events. They are uncorrelated and can be described by a Gaussian centered 
around zero whose width is nearly independent of the true top quark mass (cf. Figure 125]) . 
To obtain the weight as a function of the top quark mass alone, a scan over the unknown 
values of r\ v and ijy is performed, and the corresponding weights are multiplied with the a 
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priori probabilities P (n u , rfc) of the neutrino and antineutrino pseudorapidities. Finally, as 
both jet-lepton pairings also have the same a priori probability, one obtains 

w(m t ) = P (^> Vv)™ (m, Vu, ifc, pairing) . (15) 

pairings rj v , rfc 

While the weight distribution w(m t ) for a single event can have more than one relative max- 
imum, the average weight distribution for many simulated events has one maximum close to 
the true top quark mass, as shown in Figure 1261 
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Figure 25: CDF dilepton neutrino weighting template measurement fTTj '- The distribution 
of neutrino pseudorapidities rj with a Gaussian fit (upper left) and the correlation with the 
antineutrino pseudorapidity (upper right), determined from simulated ti events with m t = 
178 GeV using herwig Monte Carlo. The width of the fitted Gaussian as a function of m t is 
shown in the lower plot; here, the line indicates the width for mt — 178 GeV. 



Both CDF [7T] and DO [72J have performed measurements where the mt value that max- 
imizes the weight distribution is taken as the top quark mass estimator for a given event. 
These analyses then proceed as described above in Section 17.11 The templates are fitted as a 
function of the most likely top quark mass, and for the signal templates, the dependence of 
the fit parameters on the true top quark mass is parametrized as well [71] or included in a 
two-dimensional fit of the templates as a function of the estimator value and the true input 
top quark mass [72]. 

This procedure where the m t value that maximizes the weight is taken as estimator does 
not take into account that events with a broad maximum contain less m t information than 
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Figure 26: Dilepton neutrino weighting template measurements: (a) The weight distribution 
as a function of assumed top quark mass for one ti event with m t = 170 GeV simulated 
using herwig, reconstructed at CDF [71]/. The top quark mass value taken as estimator for 
this event is indicated by the vertical line, (b) The average weight distribution (obtained by 
summing weights for many simulated events) for ti events with m t = 175 GeV generated with 
pythia, reconstructed at DO f7H| /. 



events where the maximum is strongly peaked. DO has therefore made measurements that 
use a vector of estimators. This vector either contains the weight histogram integrated in a 
few coarse bins or the mean and RMS of the weight distribution [72] . These multidimensional 
estimators cannot be fitted like in the one- dimensional case; therefore, the signal and back- 
ground probability densities for a given vector of estimators are determined from the density 
and weights of simulated events with estimators that have nearby values in estimator-space. 
A parametrization of probability densities as a continuous function of assumed top quark 
mass is not performed, either. The gain from these methods is modest: DO finds that relative 
to the analysis that uses as estimator the m t value that maximizes the weight, the expected 
uncertainty decreases by 7% when using five bins, but even increases by 5% when using the 
mean and RMS [72] • Rather than introducing multidimensional estimators for the same quan- 
tity, a more natural approach to extracting more information from the events is to go beyond 
template methods altogether, as described for example in Section [HJ 

Neutrino <fi Weighting Method: Instead of assuming the pseudorapidities rj for both 
neutrino momenta, one can assume values of their azimuthal angles. This procedure is used 
by CDF [71J. A top quark mass estimator for a selected event is obtained via the following 
steps: 

• For each assumed pair of neutrino <p values, the event kinematics is reconstructed in 
a kinematic fit assuming a ti event that constrains the lepton-neutrino pairs to the W 
boson mass and constrains the two top quarks to have equal masses within the top quark 
width. For each pair, there are 8 solutions arising from the two-fold ambiguity in solving 
for the neutrino longitudinal momentum and from the two lepton-jet pairings, and the 
fitted top quark mass from the solution with the smallest y 2 is taken. 
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• A 12 x 12 grid of assumed neutrino values is tested. The top quark mass calculated 
for each given pair of assumed neutrino </> values is weighted by its \ 2 probability. The 
weighted average of the top quark masses is then taken as estimator for the event, where 
only assumptions with a weight of at least 30% of the maximum weight in the event are 
considered. 

Full Kinematic Analysis: A third method employed by CDF uses an assumed value p z (ti) 
of the longitudinal momentum component of the ti system to solve the event kinematics |71j . 
The p z (tt) distribution is expected to be centered around zero with a width of 180 GeV, where 
the width changes by 10% when varying the top quark mass between 140 and 200 GeV. There 
are two possible jet-lepton pairings, each with up to four different solutions to the kinematic 
equations for a given assumed value of p z (ti). The up to eight possible solutions for the top 
quark mass under the assumption of a ti event are calculated. This procedure is repeated 
10000 times, with p z (ti) values drawn from the expected distribution, and with the measured 
jet energies and the missing momentum varied within their resolutions. Of the four most 
probable values corresponding to the four solutions for a given jet-lepton pairing, the one 
yielding the smallest ti invariant mass is retained, if any solution was found. 

From the resulting distributions of top quark masses for each of the two jet-lepton pairings, 
the one pairing is chosen for which the number of trials that yielded no solution is smaller. 
The most probable value of the corresponding distribution of top quark mass solutions is taken 
as the estimator. The procedure slightly favors lower top quark masses; however this is valid 
since only an estimator for the top quark mass is desired, and the bias introduced is corrected 
for when the method is calibrated using simulated events. 

Matrix Weighting Method: In the Matrix Weighting technique employed by DO [73] the 
kinematics of the ti candidate event is solved by assuming a value for the top and antitop 
quark masses. There are four solutions to the kinematic equations for a given jet-lepton 
pairing. The weight for a given solution is computed from the proton and antiproton parton 
distribution functions /pdf and /pdf and the probability p(E^\m t ) of a charged lepton £ to 
have energy E\ in the top quark rest frame as 



where x and x denote the momentum fractions of the colliding partons in the proton and 
antiproton. To compute the total weight for a given m t assumption, the weights for both 
jet-parton assignments and all solutions are summed. Using resolution sampling, the above 
procedure is repeated many times with reconstructed energies/momenta and the missing trans- 
verse momentum drawn from distributions according to the detector resolution, and for each 
assumed top quark mass, the mean total weight is determined. The value of the assumed top 
quark mass where the mean total weight reaches its maximum is then taken as the estimator 
for the measurement. 




(16) 
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8 The Matrix Element Measurement Method 

The previous section described different possibilities for computing an estimator of 
the top quark mass in each ti candidate event, and how the expected estimator distri- 
bution as a function of the assumed top quark mass can be used in a top quark mass 
measurement. In this section, a different measurement strategy is described, where 
for each selected event a likelihood as a function of the assumed top quark mass is 
calculated. The section starts with the definition of this likelihood, then describes the 
parametrization of the detector resolution needed to compute it, and continues with 
an in-depth discussion of how the likelihood is calculated for the various (signal and 
background) processes via which a candidate event may have been produced. 

The Matrix Element method is based on the likelihood to observe a given event in the 
detector, calculated as a function of assumed top quark mass. The Matrix Element method 
was first used by the DO collaboration for the measurement in the £+jets channel at Tevatron 
Run I [65], where it yielded the single most precise measurement of the top quark mass. 
The method has been applied to the measurement in the £+jets channel at Run II by both 
CDF [66] and DO [39], and CDF has also used it in the dilepton channel [H [73 [76]. The 
Dynamical Likelihood method follows a similar concept. It has been used by CDF in the 
£+jets [STJ and dilepton [77] channels at Run II and is described in this section together with 
the Matrix Element method. 

The selection of events used in the analyses is briefly described in Section IHTT1 An overview 
of the calculation of the event likelihood is given in Section IH72| and the calculation of the like- 
lihood for a given process is explained in Section ET3"1 Section 15^41 discusses the parametrization 
of the detector response. This includes a description of how 6-tagging information can be used 
in the analysis. Technical details on the computation of the signal and background likelihoods 
are given in Sections 18.51 and 18.61 with an emphasis on the DO Run II analysis in the £+jets 
channel which serves as an example. 

8.1 Event Selection 

The selection of events for the measurements with the Matrix Element method is very similar 
to the general criteria described in Sections 14.4.11 and 14.4.21 and to the selections used in 
measurements based on the template method, see Section [7J 

There is, however, one aspect that deserves special attention. Leading-order matrix ele- 
ment calculations are used to evaluate likelihoods with which the selected events are produced 
in the signal and background processes. Initial- and final-state gluon radiation are therefore 
not accounted for in these likelihoods. This is not a problem since the calibration of the 
measurement is based on fully simulated events which do include gluon radiation, cf. Sec- 
tion 110.21 Nevertheless, both CDF and DO select only events with exactly four jets for their 
Matrix Element measurements in the £+jets channel. While not completely removing events 
with significant gluon radiation, this cut still reduces their contribution to the event sample. 
Also, the complication of selecting the jet from radiation and assigning it either to initial-state 
radiation or to final-state radiation off one of the ti decay products is avoided. On the other 
hand, the measurements in the dilepton channel are more severely limited by statistics. Here 
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the event selection requires two or more jets, as for the template analyses, and any jets but 
the two highest--&r ones are assumed to be due to initial-state radiation. 

In the £+jets channel, the DO experiment has performed both a topological and a b- 
tagging measurement (i.e. disregarding/using 6-tagging information); in both analyses, all 
events (irrespective of the number of b tags) are used. Unless noted explicitly, the following 
description refers to the 6-tagging analysis. In contrast, the CDF event selection in the £+jets 
channel requires at least one jet to be 6-tagged; this selection is used in the Matrix Element and 
also the Dynamical Likelihood measurement. In the dilepton channel, CDF has performed two 
measurements based on the Matrix Element method, one not using 6-tagging information and 
the other requiring at least one 6-tagged jet. The dilepton measurement with the Dynamical 
Likelihood technique does not use 6-tagging information. 

8.2 The Event Likelihood 

To make maximal use of the kinematic information on the top quark mass contained in the 
event sample, for each selected event the likelihood L evt that this event is observed is calculated 
as a function of the assumed top quark mass. In analyses where additional parameters like 
the overall jet energy scale are to be measured simultaneously with the top quark mass, L evt 
is also a function of the assumed values of these parameters. The likelihoods for all events 
are then combined to obtain the sample likelihood, and the measurement of the top quark 
mass and of the other parameters, if applicable, is extracted from this sample likelihood. To 
make the likelihood calculation tractable, simplifying assumptions in the description of the 
physics processes and the detector response are introduced as described in this section. Before 
applying it to the data, the measurement technique is however calibrated using fully simulated 
events, and the assumptions the full simulation makes to describe the physics processes are 
accounted for by systematic uncertainties. 

It is assumed that the physics processes that can lead to an observed event do not interfere. 
The likelihood L evt then in principle has to be composed from likelihoods for all these processes 
as 

processes P 

where L P is the likelihood for the event to be created via a given process P, and f P denotes the 
fraction of events from that process in the event sample. In practice, not all possible processes 
can be accounted for explicitly, and simplifying assumptions are made in the calculation of the 
likelihoods for the individual processes. The measurement result therefore has to be corrected 
accordingly. 

A likelihood L t i is calculated for the event to be produced in the signal tt reaction; this 
likelihood will depend on the assumed top quark mass. The tt production processes taken 
into account are listed in Table [3J In their measurements using the Matrix Element method, 
CDF and DO have also included likelihoods for the event to be produced via the dominant 
background processes; this maximizes the separation between signal and background events 
and keeps corrections to the final result small. In contrast, the CDF analyses using the 
Dynamical Likelihood technique omit an explicit treatment of background at this stage and 
apply a correction for all backgrounds to the final result. When applying the Matrix Element 
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method to the £+jets channel, CDF and DO choose to determine the jet energy scale and the 
signal fraction together with the top quark mass. 
The event likelihood L evt can thus be expressed as 

L cvt (x; m t , JES, f t t) = ftfLtt m, JES) + (1 - f a ) L hkg (x; JES) , (18) 

where L bkg is a weighted sum of likelihoods for all background processes according to Equa- 
tion (I17p . The symbol x denotes the kinematic variables of the event, f t t is the signal fraction 
of the event sample, and L t i and Lbk g are the likelihoods for observing the event if it is pro- 
duced via the ti or any of the background processes, respectively. The values of f t i and JES 
are fixed if applicable, depending on the details of the analysis. An overview of the event 
likelihood calculation in the analyses described here is given in Table |3j 



channel 


method 


exp. 


parameters 


signal 

processes 


background 
processes 


reference 


£+jets 


ME 


DO 


mt, JES, f a 


qq — > ti 


W+Ap 


m 


£+jets 


ME 


CDF 


mt, JES, f a 


qq — > ti 


W+Ap 


[66J 


£+jets 


DL 


CDF 


m t 


qq -> tt, gg -> ti 






dilepton 


ME 


CDF 


m t 


qq — > ti 


Z/l*+2p, 
WW+2p, 
W+3p 


HUES] 


dilepton 


DL 


CDF 


m t 


qq -> tt, gg -> ti 







Table 3: Overview of the L evt calculation in the m t measurements using the Matrix Element 
(ME) and Dynamical Likelihood (DL) methods. The column entitled "parameters" lists the 
quantities that are measured in the analysis, and the signal and background processes taken 
into account in the event likelihood are listed in the following columns. The symbol "p" refers 
to any light parton, i.e. a u, d, s, or c quark (or antiquark) or a gluon. The lines describing 
the Matrix Element measurements by DO in the £+jets channel and by CDF in the dilepton 
channel refer to both the topological and the b-tagging analyses (the WW+2p background is 
negligible and not considered in the dilepton measurement using b tagging). 

To extract the top quark mass from a set of iV measured events X\,..,x^, a likelihood 
function for the event sample is built from the individual event likelihoods calculated according 
to Equation (fl8"|) as 

N 

L(x 1 ,..,x N ; m t , JES, fti) = JjL cvt (xi; m t , JES, f t t) ■ (19) 

i=l 

This likelihood is maximized to determine the top quark mass (and additional parameters, if 
applicable). 

8.3 The Likelihood for one Process 

To evaluate the likelihood for an observed event to be produced via a given process P, all 
possible configurations y of the four-momenta of the final-state particles that could have led 
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to the observed event x are considered. In practice, also a sum over different non-interfering 
processes is performed; thus only one likelihood is computed for different color or flavor 
configurations if the differential cross section is identical. The likelihood for a final state 
with rif partons and given four-momenta y to be produced in the hard-scattering process is 
proportional to the differential cross section dap of the corresponding process, given by 

(27r) 4 |^#p (aid2 — > v)\ 2 , , , \ 

da P ( ai a 2 ^y) = 1 ; 1 V/ — d$ n/ , 20 

where a\a 2 and y stand for the kinematic variables of the partonic initial and final state, re- 
spectively. The symbol ^p denotes the matrix element for this process, s is the center-of-mass 
energy squared of the collider, £i and £2 are the momentum fractions of the colliding partons 
cl\ and a 2 (which are assumed to be massless) within the colliding proton and antiprotorj§, 
and d$ n/ is an element of n^-body phase space. 

To obtain the differential cross section dap(pp — > y) in pp collisions, the differential cross 
section from equation (120]) is convoluted with the parton density functions (PDF) and summed 
over all possible flavor compositions of the colliding partons, 

da P (pp^y)= J £d&d& f^i) fphF(^)dap( ai a 2 ^y) , (21) 



where /pdf(0 an d 7pdf(0 denote the probability densities to find a parton of given flavor a 
and momentum fraction £ in the proton or antiproton, respectively. This equation directly 
reflects QCD factorization as introduced in Section [37T1 and corresponds to Equation (jSJ). 

The finite detector resolution is taken into account via a convolution with a transfer func- 
tion W(x, y\ JES) that describes the probability to reconstruct a partonic final state y as 
x in the detector. The differential cross section to observe a given reconstructed event then 
becomes 



dcrpipp — >■ x) = J dap{pp y) W(x, y; JES) 



(22) 



Only events that are inside the detector acceptance and that pass the trigger conditions 
and offline event selection are used in the measurement. Because of the selection cuts, the 
corresponding overall detector efficiency depends both on m t and on the jet energy scale. This 
is taken into account in the cross section of events observed in the detector: 

<xf = J da P (pp -> y)W(x,y; J ES)f &cc {x)dx , (23) 
where / acc = 1 for selected events and / acc = otherwise. 



6 The following discussion is based on situation at the Tevatron pp collider as a concrete example but is 
equally valid for the LHC when the antiproton is replaced with a proton and the appropriate PDF is used. 
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Figure 27: Schematic representation of the calculation of the likelihood for ti production to 
lead to a given observed i+jets event. The observed event x, shown in red at the right, is fixed 
while integrating over all possible momentum configurations y of final-state particles ( shown 
in blue). All possible assignments of final- state partons arising from the process shown by the 
Feynman diagram to the measured jets in the detector are considered (only one possibility is 
shown here). The differential cross section for the process shown by the diagram is convoluted 
with the probability for the final-state partons to yield the observed event (transfer function), 
and with the probability to find initial-state partons, shown in green to the left, of the given 
flavor and momenta inside the colliding proton and antiproton (parton distribution function) . 
For each partonic final state under consideration, the initial state parton momenta are known 
by energy and momentum conservation. 



For example, the likelihood to observe a tt event as x in the detector is given by 



da t i(pp — > x; m t , JES) 
Ltt[x; m u JES) = ^ 



a^(m u JES) 

t(m),JES) J E^/pV(^)/p DF (6)x (24) 
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JES) 



The contributions to the likelihood are visualized schematically in Figure |2~T1 A similar formula 
holds for the likelihood to observe a background event as x, except this likelihood does not 
depend on the top quark mass. 

Details of the parametrization of the detector response are given in Section 18.41 The 
parametrization of the matrix element and the computation of L t i are described in Section 1831 
The determination of L bkg is discussed in Section 18.61 
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8.4 Description of the Detector Response 

The transfer function W(x, y\ JES) relates the characteristics y of the final-state partons to 
the measurements x in the detector. The symbol x denotes measurements of the jet and 
charged lepton energies or momenta and directions as well as 6-tagging information for the 
jets. A parametrization of the detector resolution is used in the likelihood calculation because 
the full GEANT-based simulation would be too slow. The full simulation is however used to 
generate the simulated events with which the method is calibrated. In this section, the general 
form of the transfer function is first described, followed by a discussion of the individual factors. 

8.4.1 General Form of the Transfer Function 

The transfer function W(x,y; JES) describes the probability density dP to reconstruct a 
given assumed partonic final state y as x in the detector: 



Because the final-state partons are assumed to give rise to some measured event x, the nor- 
malization condition 



holds, where the integral is over all possible observed events x. 

The transfer function is assumed to factorize into contributions from each measured final- 
state particle. Aspects to be considered in the transfer function are in principle the measure- 
ment of the momentum of a particle (both of its energy and of its direction) as well as its 
identification. 

The relative importance of the energy resolution for electrons, muons, and quarks for the 
top quark mass reconstruction has been studied qualitatively in simulated top quark decays 
with a leptonic W decay, reconstructed with the DO detector. Of the three top quark decay 
products, the reconstructed momentum from either the charged lepton or the b jet is taken, 
while the true values are taken from the simulation for the other two particles. The three 
plots in Figures fToTa). (b), and (c) show qualitatively that the resolution of the top quark 
mass is dominated by the jet energy resolution, while the effect of the electron resolution is 
comparatively small. The effect of the muon resolution at high muon transverse momentum is 
of the same order as that of the jet resolution, while muons are comparatively well-measured 
at low pt, as shown in Figures [T5Tbl)-(b6). 

Consequently, the following assumptions are made about how final-state particles are mea- 
sured in the detector, which allow reducing the dimensionality of the integration over 6-particle 
phase space described in Section IHU1 

• Electrons: Apart from efficiency losses (which are not described by the transfer func- 
tion), electrons are assumed to be unambiguously identified (i.e. an electron is not re- 
constructed as a muon or a hadronic jet). The electron direction and energy are both 
assumed to be well-measured, i.e. during integration, the final-state electron is assumed 
to be identical to the measured particle. 



dP = W(x,y; JES)dx . 



(25) 




(26) 
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Muons: As for electrons, muons are assumed to be unambiguously identified, and their 
direction to be precisely measured. While CDF also considers the muon p? to be well- 
measured, DO introduces a transfer function that allows for a finite resolution. This has 
primarly an effect for muons with large Pt- 

Energetic quarks and gluons: Energetic quarks and gluons are almost always re- 
constructed as a jet in the detector. There is however a small probability that they 
are reconstructed as an isolated fake lepton, and it depends on the process considered 
whether or not this possibility has to be taken into account: If the number of energetic 
leptons required in the event selection is already present in the final state, the possiblity 
that this lepton is not reconstructed and a quark/gluon fakes a lepton that is selected 
can be ignored. If fewer energetic leptons are produced than required in the selection, it 
is assumed that the signatures of the remaining leptons in the detector have arisen from 
quarks or gluons. This effect becomes important in the case of dilepton events, where 
background from iy+3p production is explicitly taken into account. 
Both DO and CDF assume the direction of the final-state quarks and gluons to be well- 
measured by the jet directions. Transfer functions are introduced for the jet energy (or 
transverse energy). 

The ability of the detector to distinguish quarks and gluons and to identify the flavor 
of final-state quarks is limited. The probability to obtain a 6-tagged jet is largest for 
b quarks, and it is still larger for c quarks than for light quarks or gluons. While the 
DO measurement takes into account the possibility that a b tag is faked by a c or light 
quark or gluon, CDF makes the approximation that this probability is zero for signal 
events where enough b quarks are present in the final state. This has consequences for 
the assignment of final-state quarks to measured jets in £+jets tt events, where there 
are two b quarks and two light (or charm) quarks in the final state. In dilepton tt events 
with only two b quarks in the final state, this complication does not arise. 

Neutrinos: Neutrinos are not measured in the detector. The presence of energetic 
neutrinos can be inferred from the transverse momentum imbalance {fr)] however this 
quantity depends on the momenta of the other objects measured in the detector. The 
observed missing transverse momentum is due to the neutrinos as well as to mismea- 
surement of jet energies and of the energy -S^recoii 

from other objects in the detector 

against which the tt system recoils. 

In the Matrix Element analyses, the tt transverse momentum is assumed to be zero, 
and the transverse components of the sum of neutrino momenta is obtained as the 
negative vector sum of all other assumed final-state particle transverse momenta. The 
dilepton measurement includes a transfer function factor that describes the likelihood 
with which the measured value of -&r,rccoii is obtained if the assumed value is zero. The 
Matrix Element measurements in the £+jets channel do not include this factor. 
In the Dynamical Likelihood measurements, the transverse components of the sum of 
neutrino momenta are taken as the negative vector sum of all other assumed final-state 
particle transverse momenta and the unclustered transverse momentum in the calorime- 
ter; consequently the tt transverse momentum is assumed to be minus the unclustered 
transverse momentum and not necessarily zero. No transfer function factor for the 
unclustered transverse momentum is included. 
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In addition to the energy resolution, one has to take into account the fact that the jets in 
the detector cannot be assigned unambiguously to a specific final-state parton. (Similarly, it is 
not known which reconstructed electron was faked by which final-state parton, if applicable.) 
Consequently, all possibilities must be considered in principle, and their contributions to the 
transfer function summed. 

If no 6-tagging information is used, the transfer function W(x,y; JES) is given by 



W(x,y;JES) = f[ 6®(p?° - pf) * (27) 

e=l 

n 5(2) K cc - *r) w » {(qmt > wpt)?) x 

m=l 

-i "comb n 3 

— E II 5(2) fe -^onk) (E™, E^ tonk] JES) x 



^comb • -, , 



II Recoil " (-Ptt%) , 



where the four lines represent the contributions from electrons, muons, jets, and the recoil 
energy of the tt system, respectively. It is understood that a term only appears if the cor- 
responding particle appears in the final state under consideration. The symbols n e , n M , and 
rij stand for the numbers of electrons, muons, and jets in the final state, and e, m, and j 
stand for a specific reconstructed particle. The number of possible assignments of jets j to 
final-state partons k is denoted by n com b, and i stands for one specific permutation. The sum- 
mation over reconstructed jets j implies a sum over final-state partons k. The reconstructed 
(rec) and assumed (ass) values of the energy E and momentum vector p, the unit vector u 
along the direction of the momentum, and the charge q of a particle enter the transfer func- 
tion. The terms describing the muon and jet resolution are parametrized in {q/pr)^ and the 
jet/parton energy, respectively, since these are the quantities measured in the detector. For 
the tt recoil, the x and y components are assumed to be independent. An additional term is 
added corresponding to a sum over all possibilities for final-state partons faking an electron, 
if applicable. In Equation ( 127)) it is assumed that reconstructed charged leptons can be unam- 
biguously assigned to final-state leptons; this is justified even in the dilepton channel because 
both jet-parton assignments are considered. 

If no 6-tagging information is used, the information from the reconstructed jet momentum 
vectors determines the relative weight of different jet-parton assignments for a given partonic 
final state. The inclusion of 6-tagging information allows for an improved identification of the 
correct jet-parton assignment in final states like £+jets tt events that contain b quarks as well 
as light partons. This can be encoded in the jet transfer function by an additional factor Wb 
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for each jet: 

W(x, y- JES) = Y[ 5(3) (pT - PD x (28) 



e=l 
m=l 
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The quantity Wb describes the probability for parton k with given assumed flavor 0partonfc to 
be reconstructed with fe-tagging information B™tj- If b tagging is used as a binary decision (of 
a jet to be 6-tagged or not to be 6-tagged) as is the case in the analyses described here, then 
one simply has 

f tb tertonfc) if the J et 3 is 6-tagged and 
W b <p- tonk ) = (29) 

I 1 ~ e b Op^ton k ) otherwise, 

where e& (0) is the 6-tagging efficiency for a jet from a parton of given flavor 0. 

In principle, the transfer function can still depend on the top quark mass. This comment 
applies in particular to the term describing the jet transfer function: The event topology and 
thus the angular separation between the jets depends on the top quark mass. For example, 
the probability of misassignment of particles to the wrong jet can therefore slightly depend 
on the top quark mass. A study of the m t dependence of the transfer function is described 
in [5?]. None of the analyses described here parametrize the mt dependence explicitly in the 
transfer function; instead, the analyses rely on the calibration with fully simulated events to 
correctly account for any such dependence in the measurement on data. 



8.4.2 Simplifying Assumptions 

In general, the analyses do not use the full transfer function given in Equation (128]) but 
introduce further simplifications. These are discussed in this section. 



Muon Transfer Function: The CDF analyses treat the muon transverse momentum as a 
well- measured quantity, similar to the electron energy. This is justified since muons at very 
high pt such that they have a sizeable effect on the top quark mass resolution consistute 
a small fraction of the sample (cf. Figures IT5T bl)-(b6)). This assumption will lead to an 
increased pull width (since the uncertainty on the muon px is set to zero) to be accounted 
for in the calibration, and to a slightly increased measurement uncertainty (since the relative 
weight of events with and without a high-p^ muon is non-optimal). 
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Jet Transfer Function: The hadronization process depends on what kind of parton initi- 
ates a jet. The analyses use the same transfer function to describe light-quark (u, d, s, and 
c) and gluon jets; an independent transfer function is used for b jets. The DO experiment 
further distinguishes between b jets that contain a reconstructed (soft) muon and other b 
jets: The muon is taken as an indication for a semimuonic bottom- or charm-hadron decay, 
and the special transfer function allows to account on average for the energy carried by the 
unreconstructed neutrino. Semielectronic decays or semimuonic decays where the muon is 
not identified are not treated explicitly and still have to be accounted for on average by the 
generic 6-quark transfer function. 



Treatment of 6- Tagged Jets: The 6-tagging efficiency is much larger for 6-quark jets than 
for jets from light quarks or gluons. The CDF collaboration therefore makes the assumption for 
the calculation of the likelihood in their £+jets analyses that a 6-tagged jet always corresponds 
to a b quark, if b quarks are present in the final state. Since CDF requires at least one b- 
tagged jet in the event, the computation time is significantly reduced since fewer jet-parton 
assignments remain to be considered. This assumption corresponds to the approximation that 
the 6-tagging efficiency for jets without a b quark is zero, and Equation (1291) becomes 



( ^.jetj > 0parton k ) 



for 6-tagged light-quark/gluon jets, 
eJb) for 6-tagged fe-quark jets, 

(30) 

1 for untagged light-quark/gluon jets, and 
1 — €b(b) for untagged 6-quark jets. 



For the remaining jet-parton assignments, the overall factor ]X=i W b (^jetj> ^partem k) * n the 
signal likelihood is then 



n 



W (B iec </> ass ) — / e6 ^) x (l ~~ e fo (6)) x 1 x 1 for events with one 6-tagged jet and 
b\ jet j) s^partonfc^ ye b (b)x e b (b) xlxl for events with two 6-tagged jets, 



(31) 

where the two factors for the two jets that are assumed to originate from b quarks are given 
first. For both single- and double-tagged events, this product is almost identical for all jet- 
parton assignments considered. In the measurement using the Dynamical Likelihood method, 
in which only the signal likelihood is considered, this yields one multiplicative scale factor for 
each event likelihood. Such a scale factor is irrelevant for the m t fit and can thus be neglected. 

For the calculation of the background likelihood in the Matrix Element measurements in 
the £+jets channel, only processes without b quarks in the final state are considered, and the 
overall factor n.j=iW& (^jetj> ^parton k) can a g & i n De considered identical to a good approxi- 
mation for all jet-parton assignments, such that it only depends on the number of 6-tagged 
jets in the event. 

To proceed, one can divide the event sample into events with zero, one, and two b tags 
and determine the relative normalization of the signal and background likelihoods separately 
for the three subsamples. This approach was chosen in the DO analysis and is described in 
the following. 
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In contrast to the approximation made by CDF that 6-tagged jets always correspond 
to b quarks, the DO £+jets analysis considers all possible jet-parton assignments even if b- 
tagged jets are present. For the ti likelihood, the factor Wb (-Bjetj, 0partonfc) i s taken from 
Equation (|29i) with the only modification that it is approximated to 1.0 for all jets if none 
of the four jets is 6-tagged. Nevertheless, assumptions on the jet flavors are introduced for 
the calculation of Wb (^jetj> 0partonfc) sucn that the likelihoods for ti events with W decays 
to ud! and cs' final states need not be calculated separately, allowing for a reduction of the 
computation time. If an event contains exactly one 6-tagged jet, the quarks from the hadronic 
W decay are both assumed to be light quarks (u, d, or s). This is justified since the tagging 
efficiencies for b jets are much larger than those for other flavors, and there are two b jets per 
event. For events with two or more 6-tagged jets, a charm jet from the hadronic W decay is 
tagged in a non-negligible fraction of cases. Consequently, the quarks from the hadronic W 
decay are assumed to be charm quarks if the corresponding jet has been tagged, and light 
quarks otherwise. 

The improvement from the inclusion of jet-parton assignments with a tagged charm jet in 
the likelihood calculation can be seen by comparing the signal and background likelihoods. 
Figure [251(a) shows the ratio of ti to background likelihoods in simulated £+jets it events re- 
constructed at DO with two 6-tagged jets when only the two jet-parton assignments in which 
tagged jets are assigned to b quarks are considered in the signal likelihood calculation. The 
hatched histogram shows the correct assignments only, whereas the open histogram shows 
all combinations, including the ones in which a charm quark from the W decay was tagged. 
Figure 125T b) shows the same ratio when all combinations are included with their appropriate 
weights as discussed above. The tail for low signal to background likelihood ratios in Fig- 
ure l2B7a) arises because the correct jet-parton assignment is not included in the calculation 
in events where one of the tagged jets comes from a charm quark. 

The different flavor contributions to the W^+jets process are parametrized by the W+jets 
matrix element without heavy flavor quarks in the final state, which means that the factors 
Wb (£>jet j, 0partonfc) f° r the background likelihood are all equal for a given event even if 6-tagged 
jets are present. Therefore, the factors are omitted altogether from the background likelihood 
calculation. To account for the different amount of background in the event categories with 
zero, one, and two (or more) b tags, the relative normalization of ti and background likelihoods 
in the three samples is adjusted accordingly, as suggested by Equation (JT7|) . 

Transfer Function for the Recoil Energy: The transfer function for the recoil energy 
is so far only included in the CDF measurement in the dilepton channel [H]. In the £+jets 
analyses, this term is omitted, which leads to a slightly increased expected statistical mea- 
surement uncertainty but avoids any explicit dependence on the modeling of the unclustered 
transverse momentum in the simulation. 

8.4.3 Parametrization of the Jet Energy Resolution 

The jet energy transfer function, Wj et (-Ejety, ^partonfcj JES), yields the probability density 
for a measurement E^ in the detector if the true quark energy is E^ tonk , given an overall 
jet energy scale JES. Both CDF and DO describe it as a function of the difference AE = 
£jetj — -^partem fc an d the assumed parton energy E^ tonk (the Dynamical Likelihood analysis 
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Figure 28: Monte Carlo study of the effect of charm-jet tagging on the signal to background 
likelihood ratio in the DO Matrix Element analysis in the £+jets channel I3$/ . for ti events 
generated with mt = 175 GeV reconstructed with the DO detector that contain two b -tagged jets. 
The L t i values are calculated for the assumption m t = 175 GeV. (a) Only the two jet-parton 
assignments in which tagged jets are assigned to b quarks are considered, (b) All weighted 
jet parton- assignments enter the likelihood calculation. In both plots, the hatched histogram 
corresponds to those cases where the two b-tagged jets are correctly assigned to b quarks, which 
happens 84% of the time in the double-tag sample. 



uses the transverse energy instead of the energy). Different transfer functions are determined 
for jets from light quarks and gluons and for 6-quark jets, and DO also treats b jets with 
a soft muon from semimuonic heavy hadron decay separately from other b jets. In the DO 
Matrix Element analysis and the CDF measurement with the Dynamical Likelihood technique, 
different sets of transfer functions are also derived for different \n\ regions. 

In their Matrix Element analyses, both CDF and DO use a double Gaussian as a function 
of AE with parameters that depend linearly on the assumed quark energy to describe the jet 
energy transfer function^- For the case JES — 1, it is parametrized as 

W }et (E^,E^ tonk ;JES=l) = 7 * ( 32 ) 

V27r(p 2 + PsPs) 

r / (AE- Pl y \ , / (AE- Pi y \] 

M — 2^j + ^ e n — si— J. • 

The parameters pi are themselves functions of the quark energy, and are parametrized as 
linear functions of the quark energy so that 

Pi = di + hE^ tonk , (33) 



In the Dynamical Likelihood analysis, CDF does not use a parametrization of the transfer function, but 
uses random numbers generated according to the distributions. 



66 



8 THE MATRIX ELEMENT MEASUREMENT METHOD 



with CL3 fixed to in the DO analysis. The parameters and 6j are determined in a fit 
from simulated ti events, after all jet energy corrections have been applied. The DO transfer 
function for light quarks in the region < 0.5 is shown in Figure 1291 
For JES 7^ 1, the jet transfer function is modified as follows: 

rp rcc 

W- ( jct j F ass ■ 

tt/ /zrirec j-iass j 771 o\ J e H JES ' partonfc' 7 / q .n 
^jetl-^jetj^partonfc! Jhb ) = j^jn > l d4 J 

where the factor JES in the denominator ensures the correct normalization 

/ w jet{E™ t ),E^ tonk ] JES)dE™ t C j = I . (35) 
Je. t i c . 




Figure 29: Jet energy transfer functions for light- quark jets in the DO detector in the region 
\r)\ < 0.5, for parton energies E p = 30 GeV (solid), 60 GeV (dashed), and90 GeV (dash-dotted 
curve). The parametrization corresponds to the reference jet energy scale, JES = 1.0 I3$tf . 



8.4.4 Parametrization of the Muon Momentum Resolution 

Only the DO experiment so far considers the muon resolution explicitly in the likelihood 
calculation. To describe the resolution of the central tracking chamber, the resolution of 
the charge divided by the transverse momentum of a particle is considered as a function of 
pseudorapidity. The muon transfer function is parametrized as 

w, ( h/*rr , (<M7 ) = ^ex P f-i ( toM7-toM7 ) *, , (36) 




2ira 

where q denotes the charge and pt the transverse momentum assumed (ass) or reconstructed 
(rec) for a muon. The resolution 

do for \q\ < 770 

a = { t (37) 



+ - Vo)] 2 for |r/| > ^ 



8.5 The Signal Likelihood L t i 



67 



is obtained from muon tracks in simulated events. The muon charge is not used in the 
calculation of L t j and Lbk g ; however, for muons with large transverse momentum the possibility 
of reconstruction of a track bent in the wrong direction is automatically taken into account 
in the transfer function when using this parametrization. 



8.4.5 The Transfer Function for the Unclustered Transverse Momentum 

The probability density to observe unclustered transverse momentum in the event is only 
used in the CDF dilepton analysis; it is parametrized as a Gaussian in each of the x and y 
directions, with no correlation. 



8.5 The Signal Likelihood L t i 

When spin correlations between the top and antitop quarks are neglected, the leading-order 
matrix element for the process qq — > ti is given by 



>qq-^ttt | 



91 
9 



FF (2 - f3 2 s 2 qt ) 



(38) 



where g 2 /(4n) = a s is the strong coupling constant, (3 is the velocity of the top quarks in 
the ti rest frame, and s q t denotes the sine of the angle between the incoming parton and the 
outgoing top quark in the ti rest frame. If the top quark decay products include a leptonically 
decaying W boson, while the antitop decay includes a hadronically decaying W, one has 
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+ {m w Y 



w , 



(39) 



(40) 



(for the reverse case in £+jets events, replace h -h- b, £ -H- d, and v <H- u; for dilepton events, 
replace d and u by the second charged lepton and neutrino, respectively). Here, g w denotes 
the weak charge (Gf/\/2 = g^/Sm^), m t and m w are the masses of the top quark (which 
is to be measured) and the W boson, and T t and IV are their widths. Invariant top and W 
masses in a particular event are denoted by m xyz and m yz , respectively, where x, y, and z are 
the decay products. The cosine of the angle between particles x and y in the W rest frame is 
denoted by c xy . Here and in the following, the symbols d and u stand for all possible decay 
products in a hadronic W decay. The top quark width is given as a function of the top quark 
mass as [3] 



1 - 



rn 



mi 



1 + 2- 



m 



1 - 



2a, /2tt 



3tt 



(41) 



The correct association of reconstructed jets with the final-state quarks in Equations fl39j) 
and (T4"0"j) is not known. Therefore, the transfer function takes into account all possible jet- 
parton assignments as described in Section I8~4"l However, in the case of the signal likelihood for 
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£+jets events, the mean value of the two assignments with the 4-momenta of the quarks from 
the hadronic W decay interchanged may be computed explicitly by using the symmetrized 
formula 



- q 4 
F = — 



(m^ - m 2 w f + (m w rV) 2 



(42) 



instead of (HOI) , where only the terms containing are affected. Consequently, only a sum- 
mation over half the jet-quark assignments remains to be evaluated. 
The leading-order matrix element for the process gg — > ti is [57] 



'gg-nt 
with 



'-^(^-sX^+'-i^)' (43) 



raJiM* - m L , 4m 2 

2 aIld P — 2 



and p = 2 , (44) 

blvbdu blvbdu 

where i = 1,2 denotes the two incoming gluons. Here, again ti spin correlations have been 
neglected. This process is only taken into account explicitly in the CDF measurement based on 
the Dynamical Likelihood method. In the Matrix Element measurements it is not computed 
because the top and W propagator and decay parts of the matrix element, which contain most 
of the information on the top quark mass and the separation of signal and background events, 
are identical. 

The computation of the signal likelihood L t i involves an integral over the momenta of 
the colliding partons and over 6-body phase space to cover all possible partonic final states, 
cf. Equation (I24p . The number of dimensions of the integration is reduced by the following 
conditions: 

• The transverse momentum of the colliding partons is assumed to be zero, or to be con- 
sistent with the observed unclustered transverse energy (in the Dynamical Likelihood 
measurement). The transverse momentum of the ti system then follows from conser- 
vation of 4-momentum because the leading-order matrix element is used to describe ti 
production. Also, the z momentum and energy of the ti system are known from the 
momenta of the colliding partons. 

• The directions of the quarks and the charged lepton in the final state are assumed to be 
exactly measured. 

• The energy of electrons from W decay is assumed to be perfectly measured. The corre- 
sponding statement is not necessarily true for high momentum muons, and an integration 
over the muon momentum is performed in the DO analysis. 

Even after these considerations, a multi-dimensional integral remains to be calculated. In 
the Matrix Element analyses, this calculation is performed numerically with the Monte Carlo 
program VEGAS [TjB EUJ ■ 
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8.6 The Background Likelihood 

There are in general many background processes that can lead to an observed event. It is not 
problematic per se to not fully account for all backgrounds in the event likelihood; in fact, the 
Dynamical Likelihood measurements by CDF omit any explicit treatment of background in 
the likelihood. Because of the assumptions made in the Matrix Element technique, it is always 
necessary to calibrate the measurement technique with pseudo-experiments with varying input 
top quark masses, jet energy scales, and sample compositions as described in Section [10. 21 An 
incomplete background likelihood will lead to a shift of the measured top quark mass value; 
this shift will in general depend on the top quark mass itself and on the fraction of events 
in the sample that are not accounted for in the overall likelihood. The shift is determined in 
the calibration procedure. When a background term is omitted in the event likelihood, the 
situation will thus be quantitatively, but not qualitatively different from that in an analysis 
that includes this term in the likelihood. 

If several different background processes have similar kinematic characteristics, it is also 
possible to approximately describe the total background by the likelihood for only one of the 
background processes, multiplied by the total background fraction, cf. Equation ffTHj) . This 
technique has been applied by both CDF and DO in the Matrix Element analyses in the £+jets 
channel, where a likelihood for QCD multijet production is not explicitly calculated. While 
this is a better approximation than not accounting for multijet background at all, it still has 
to be studied with pseudo-experiments and taken into account in the calibration. It should be 
noted that independently of the definition of the background likelihood used, any uncertainty 
in the characteristics of a background process has to be evaluated with pseudo-experiments 
and accounted for by a systematic error on the final measurement value, see Section 111.1.81 

Even if only leading-order background processes and only the most important among them 
are considered, it is not practical to explicitly evaluate all individual diagrams. Instead, rou- 
tines from existing Monte Carlo generators are used to compute the likelihood for generic 
processes. They take into account the relative importance of the various subprocesses that 
contribute and perform a statistical sampling of all possible spin, flavor, and color configu- 
rations. Because the background likelihood does not depend on the top quark mass, it does 
not have to be computed for as many different assumptions as the signal likelihood and it is 
possible to evaluate the matrix elements without a dedicated routine optimized for speed. 

The generic background process taken into account by both CDF and DO for the Matrix 
Element analyses in the £+jets channel is the production of a leptonically decaying W boson 
in association with four additional light partons, iy+4p. Events with a leptonically decaying 
W boson and four partons that include heavy-flavor quarks are not considered separately 
because their kinematic characteristics are very similar to those of IV-Klp events. QCD multijet 
production, the second-largest background source, is not taken into account explicitly in the 
event likelihood. 

The modeling of the Vr + 4p process in the vecbos [51] generator is used to calculate 
the background likelihood Lbkg- The jet directions and the charged lepton are taken as well- 
measured, also for muons in the DO analysis. The integral over the quark energies in Equa- 
tion f )22|) is performed by generating Monte Carlo events with parton energies distributed 
according to the jet transfer function. In these Monte Carlo events, the neutrino transverse 
momentum is given by the condition that the transverse momentum of the P^+jets system be 
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zero, while the invariant mass of the charged lepton and neutrino is assumed to be equal to the 
W mass to obtain the neutrino z momentum (both solutions are considered). The mean result 
from all 24 possible assignments of jets to quarks in the matrix element is calculated, and the 
mean over a number of Monte Carlo events is taken to be the Lbkg value. The calibration 
described in Section 110.21 supports that it is not necessary to compute Lbkg for different JES 
values; only the value L bkg (JES = 1) is used. 

The CDF Matrix Element measurement in the dilepton channel considers the following 
backgrounds explicitly in the event likelihood: 

• a leptonically decaying Z boson in association with two partons, Z/j*+2p, 

• two leptonically decaying W bosons in association with two partons, WW+2p (this 
contribution is negligible if a 6-tagged jet is required and thus only considered in the 
topological analysis), and 

• one leptonically decaying W boson in association with three partons, one of which yields 
a jet that fakes an isolated charged lepton in the detector, iy+3p. 

Routines from the alpgen [20] generator are used to perform the statistical sampling to 
average the differential cross section. In the case of the Z/7*+2p process, in which no energetic 
neutrino occurs, the assumption of zero transverse momentum of the Z/7*+2p system is 
relaxed, and an integration over all possible values of pt is performed. For the l^+3p process, 
it is assumed that the isolated lepton originating from the misidentified jet carries most of the 
jet energy (otherwise it would not appear isolated in the detector), and the jet energy transfer 
function is taken to relate it with the parton energy. 

8.7 Normalization of the Likelihood for one Process 

The likelihood for a process has to be normalized by the cross section o~ ohs for observed events 
in the detector, as described in Equation (|24|) . The cross section for observed events depends 
not only on the top quark mass (in the case of L t j), but via the jet requirements in the 
event selection also on the assumed value of the JES parameter. 

To normalize the signal likelihood in the DO Matrix Element analysis, the integral <r°^ s = 
J d<Ju(pp —> x; m t , JES)f a _ cc (x)dx has been computed as a function of m t and JES as de- 
scribed in Equation (1231 . The results are shown in Figure 1301 for e+jets and /i+jets events as 
a function of mt for various choices of the JES scale factor. 

The normalization of the background likelihoods can in principle be determined in the same 
way. The computation of the integral in Equation ( 1231) would be very computing intensive even 
though the dependence of Lbkg on the JES parameter does not have to be taken into account 
as shown in the DO analysis. The DO experiment has therefore used a different method to 
compute the relative normalization of signal and background likelihoods (an overall scale factor 
is irrelevant in the analysis), assuming the relative contributions of the individual background 
subprocesses to the total background likelihood are known. This approach makes use of the 
fact that the fitted signal fraction / t j of the sample will be underestimated if the background 
likelihood Lbkg is too large and vice versa. The relative normalization can therefore be adjusted 
until the signal fraction is determined correctly in pseudo-experiments of simulated events. 
Note that this technique can only be used when the f t i parameter is left free in the fit (i.e. 
when no constraint from the ti and background cross section is used), as is the case for the 
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Figure 30: Cross section of observed ti events in the DO detector computed with the 
leading-order matrix element for (a) e+jets and (b) ri+jets events as a function of the top 
quark mass m t for different choices of the JES scale factor: JES = 1.12 (dash-dotted), 
JES = 1.0 (solid), and JES = 0.88 (dotted lines). The branching fraction ti — > bMvqq' is 
not included, as such a constant overall scale factor is irrelevant for the analysis. 



CDF and DO Matrix Element measurements in the £+jets channel. The calibration of the f t i 
fit result is further discussed in Section 110.21 
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9 The Ideogram Measurement Method 

This section describes the Ideogram method and its application in top quark mass 
measurements. Like the Matrix Element and Dynamical Likelihood techniques dis- 
cussed in the previous section, this method is based on a per-event likelihood that 
depends on the top quark mass. The signal and background likelihoods are however 
only based on the reconstructed top quark and W boson masses in each event and 
do not make use of the full kinematic information. This means that the amount of 
computation time needed for the analysis is reduced significantly. 

The Ideogram method has previously been used to measure the W boson mass at the 
DELPHI experiment at LEP [82]. It is now also applied by the DO and CDF experiments 
to measure the top quark mass using tt events in the £+jets [83] and all-jets channels [58] . 
respectively. In Section 19.14 the event selection and reconstruction using a kinematic fit are 
summarized for these two analyses. The definition of the event likelihood in the Ideogram 
method is then discussed in Section l9~2l and compared with the approach in the Matrix Element 
and Dynamical Likelihood methods. 

9.1 Event Selection and Kinematic Reconstruction 

The event selection in the DO £+jets analysis is identical to the one used in the Matrix Element 
measurement described in Section IHTLj except that events are also used if more than four jets 
are reconstructed (only the four highest--Er jets are used to measure the top quark mass). 
There is an additional cut on the \ 2 obtained from a kinematic fit as described below. 

The kinematic requirements on the events in the CDF measurement in the all-jets channel 
are similar to those described in Section 17.31 The most important requirements are: 

• no significant missing transverse energy, 

• removal of events with a charged lepton with high p?, 

• events must contain between 6 and 8 jets within \rj\ < 2.0 with Et > 15 GeV, and 

• the sum of jet transverse energies must satisfy ^j ets E T > 280 GeV. 

There are additional event quality cuts and requirements on the event shape using the apla- 
narity and centrality. 

The events are then subjected to a kinematic fit constraining them to the tt hypothesis. 
In the DO £+jets analysis, the kinematic fit is similar to the one described in Section I7.1[ 
yielding one fitted top quark mass m\, the corresponding uncertainty a l mt and the best xf 
for each of the 12 different jet-parton assignments (an interchange of the two jets assumed to 
come from the hadronic W decay does not change the kinematic fit) and for each of the two 
possible solutions for the longitudinal neutrino momentum component p VjZ . The index i thus 
runs over 24 different possibilities. All of these values depend on the assumed value JES of 
the jet energy scale. 

The kinematic fit in the all-jets case is identical to the one discussed in Section ITTBI except 
that in this analysis, the masses of the two decaying top quarks per event are treated as 
independent fit parameters. Thus, for each of the 90 jet-parton assignments i that have to be 
distinguished in an event, the two fitted top quark masses mj' ^ 2 and their uncertainties a l mj 2 
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are determined together with the minimum %\. 111 the analysis in the all-jets channel, no in 
situ calibration of the jet energy scale is performed so far. 

9.2 The Event Likelihood 

The definition of the likelihood L cvt to observe a given selected event is identical to that used 
in the Matrix Element method, cf. Section 18.21 

Levt (x; m t , JES, fu) = fuLu {x; m t , JES) + (1 - f t t) L hkg (x; JES) , (45) 

where L t i and Lbk g are the likelihoods to observe the event if it was produced via the signal 
or any of the background processes, respectively, f t i is the overall fraction of signal events in 
the selected event sample, x denotes the event observables, and m t and JES are the assumed 
values of the top quark mass and jet energy scale which are to be measured (in the all-jets 
analysis, the parameter JES is fixed to 1.0). The evaluation of the signal and background 
likelihoods however differs from that in the Matrix Element method. 

The event observables can be classified into the kinematic information xy, n used in the 
kinematic fit to reconstruct the top quark mass and other variables x topo /b (describing the 
event topology and the 6-tagging information) that are uncorrelated with the top quark mass 
and used to improve the separation of signal and background events. The likelihood for the 
event to be produced via process % can then in general be written as the product 

L P (x; mt, JES) = L P m (x kin ; m t , JES) Lp P ° /6 (x topo/b ) , (46) 

where the dependence on m t only enters for the signal process, P = ti. The second term is 
only included in the DO analysis. It recovers some of the topological information of the event 
(like the relative angles between the decay products) that can otherwise only be used in the 
Matrix Element method, while the L P m term in the Ideogram method extracts information on 
m t only from invariant mass information obtained in the kinematic fit (which in turn is also 
insensitive to angular information). In addition, event quality and 6-tagging information can 
be included in Lp PO//ft . The kinematic and topological terms in the likelihood are discussed in 
turn in the following sections. 

9.2.1 The Kinematic Likelihood for a Process 

The kinematic part of the signal or background likelihood is calculated as a sum over all jet- 
parton assignments (and neutrino solutions, in the case of the £+jets analysis). The relative 
likelihood Wi of assignment /solution % to be correct is obtained from the minimum xf of the 
corresponding fit and from 6-tagging information as 

Wi (4 in ; mt, JES) = exp (-i ( x 2 )^ f[ W b ^ tonk ) , (47) 

where the product runs over all rij jets in the event, and Wb is given by the 6-tagging efficiencies 
for light and 6-quark jets as defined in Equation f l2§|) . The weights depend on the top quark 
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mass and jet energy scale because the results of the kinematic fit do (including the minimum 

x 2 ). 

The kinematic term in the signal likelihood L^ n describes the correct jet-parton assignment 
( "ca" ) and all other assignments ( "wa" ) separately and can be written as 

Lfi n (x kin ; m t , JES) = J^w* (x l kin ] m t , JES) [ / ca S ca (^ in ; m t , JES) 

i 

+ (1 — /ca) S'wa ( x im'i m ti JES)] , (48) 

where % runs over all 24 assignments/solutions, and / ca corresponds to the relative weight given 
to the correct assignment by the weights wv In the DO analysis in the £+jets channel, the 
value of / ca is determined from the simulation as the average fraction of weights w ca _/ (£\ wi) 
given to the correct assignment; the dependence of / ca on the total number of reconstructed 
jets and the number of 6-tagged jets is taken into account. 

For the correct assignment, the likelihood to observe the fitted top quark mass mf* takes 
into account both the natural width T t of the top quark and the experimental resolution cr^ , 
which is assumed to be Gaussian and determined on an event-by-event basis in the kinematic 
fit. The likelihood is given by their convolution 

S ca (4 in ; mt, JES) = j G (mf M , m', o%f) BW(m' } m t )dm' , (49) 

where the integration is over the true mass ml of the top quark in the given event. The 
Gaussian resolution G and the relativistic Breit-Wigner BW can be expressed as 

O (*V.40- Nf^'l - ,50) 



BW(m',m t ) = \ m f' (51) 

tt (m a - mf) + mfTf 

respectively. The likelihood is sensitive to the jet energy scale via the \ 2 obtained in the 
kinematic fit since a constraint to the known W boson mass is applied. In the £+jets analysis, 
with only one fitted mass mf*' 4 per jet-parton assignment i, m! can be interpreted as the 
average of the top and antitop quark masses. In the analysis in the all-jets channel, the term 
Sea contains one integral as given in Equation (|49p for each of the two fitted masses. 

Wrong jet-parton assignments in signal tt events cannot easily be described as a similar 
convolution. Therefore, the corresponding term S'wa (x l kin ; m t , JES) is given by the distribution 
of fitted masses mf* in simulated tt events, where the two neutrino solutions for the correct 
jet-parton assignment are excluded and all other assignments/solutions are weighted with Wi. 
Even though it describes wrong assignments, S wa still depends on the top quark mass. The 
fitted uncertainty a^ t is not used. In some simulated events, the correct jet-parton assignment 
cannot be unambiguously identified. These events are excluded when determining the shape 
of S wa ; the calibration of the measurement technique is however performed using the full 
simulation including these events, as described in general in Section 110.2} so that the final 
measurement result is unbiased. 
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In both the £+jets and all-jets channels, background is described with one likelihood L bkg . 
The kinematic term of the background likelihood is given by 

L^ k n g (x kin ; JES) = mB(x^ JES) (52) 

i 

with a weight Wi per jet-parton assignment /solution i as defined above in Equation (HTj) . 
In the £+jets analysis, B is the shape of the mass spectrum obtained in simulated W / + 4p 
events, where each assignment i enters with its weight lOj as in the likelihood. The shape B 
of the background spectrum does not depend strongly on JES (the number of background 
events does, but this is not relevant since the signal fraction is a free parameter in the 
measurement), and L^ k ° is always evaluated at JES = 1 like in the Matrix Element analyses, 
see Section IHUl 

In the all-jets channel, background is described by a mixture of bb + Ap events simulated 
with alpgen and 6p events obtained from the data. Here, B is a two-dimensional function 
of the two fitted masses. As above, it is obtained as the weighted spectrum obtained in the 
background events. No JES dependence is included in the likelihood since the JES parameter 
is not fitted. 



9.2.2 The Topological Likelihood for a Process 

In the DO analysis in the £+jets channel, a term i}° po / b j s included in the likelihood that 
captures the information from the event topology, the event quality, and the number of b- 
tagged jets in the event. Note that in the kinematic term of the signal likelihood, 6-tagging 
information is included to improve the identification of the correct jet-parton assignment in 
signal events, while it is used here to improve the separation between signal and background. 
The inputs used in the calculation are: 

• Topological information: Four variables are used. These are 

— the missing transverse energy Jfir] 

— the aplanarity A as defined in Section [731 computed from the momenta of all jets 
and the leptonically decaying W boson reconstructed in the kinematic fit; 

— the ratio H' T2 of the scalar sum of the jet transverse momenta, excluding the 
highest-pT jet, and the scalar sum of the longitudinal momenta of the jets and 
the reconstructed leptonically decaying W boson; and 

— the quantity 

„ min (AKjj) min (E T>1 , E TJ ) 

&T,mm' = p , (5dJ 

where min (ATZy) is the minimum distance between any two jets among the four 
highest-pT jets. 

Although other variables like the scalar sum of all jet transverse momenta have better 
separation power between signal and background, these variables are correlated with 
the top quark mass, which is why they are not used in the topological likelihood. The 
separation obtained with these topological variables is shown in Figures l3lT a) and (d). 

• Fraction of track pt contained in jets: Considering scalar sums of track transverse 
momenta, this variable is defined as the fraction of track contained within the recon- 
structed jets of the event (i.e. within ATI < 0.5 of the calorimeter jet axes). This variable 
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distinguishes clean events from events with poorly defined jets and is uncorrected with 
the topological information described above. It provides separation in particular between 
ti and QCD multijet background events, as can be seen in Figures [3lYb) and (e). 
• 6-tagging information: Finally, the number of 6-tagged jets in the event is used as 
an input to the likelihood. 
A likelihood discriminant D is created from all input variables. The topological/6-tagging 



terms (x topo /b) and (^topo/ft) of the likelihood are then given by the fraction of 

signal or background events at the value of D reconstructed for a particular event. These are 
shown in Figures I3~ilc) and (f). Note that while topological and 6-tagging information are 
also used in the Matrix Element analysis, the pt fraction variable is unique to the Ideogram 
analysis. 
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Figure 31: DO £+jets Ideogram measurement: Topological likelihood for e+jets (upper plots) 
and fi+jets (lower plots) events. Plots (a) and (d) show the separation between ti signal (red), 
W+jets background (yellow), and QCD multijet background (blue) when only topological in- 
formation is used. In plots (b) and (e), information from the track p T fraction contained in 
jets is included. Plots (a), (b), (d), and (e) show the expected distributions on an arbitrary 
linear vertical scale. Plots (c) and (f) show the final distributions used to compute the topolog- 
ical likelihood, which also include b-tagging information. In these plots, the expectations are 
scaled to the results from 425 pb -1 of data, which are are overlaid [83]. In all plots, the ti and 
W+jets predictions are from simulated events, while the QCD multijet distribution has been 
obtained from data using a signal depleted sample. 
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10 The Top Quark Mass Fit and its Calibration 

With the methods presented in Sections^ 0, and\Q a likelihood for a sample of se- 
lected events to be consistent with a given top quark mass hypothesis can be computed. 
This section describes how this information is used to determine the measurement 
value of the top quark mass and its (statistical) uncertainty. The calibration of the 
measurement method with simulated experiments is also discussed. 

The previous sections describe how the likelihood as a function of the top quark mass 
hypothesis to obtain the observed data sample can be determined: via the comparison of the 
estimator distributions in data and simulation (template method, cf. Section [7]), or using like- 
lihoods calculated for each individual event with the Matrix Element (Section [S]) or Ideogram 
methods (Section [H]). Section riO.il describes the step of obtaining a (raw) measurement value 
of the top quark mass and its statistical uncertainty from this information. 

This (raw) measurement value is only correct if the assumptions made to derive it reflect 
reality. Uncertainties on these assumptions will translate into systematic uncertainties on 
the measurement, as described in Section [TTJ On the other hand, known deficiencies or 
approximations in the technique used to determine the likelihoods can be corrected for by 
calibrating the measurement with fully simulated events. This step also allows for a test of the 
uncertainties obtained in the fitting procedure and a comparison of the measured uncertainty 
in data with expectations. It is further described in Section 110.21 

10.1 The Fitting Procedure 

The technical details of how a (raw) measurement of the top quark mass is extracted from 
the likelihood information varies between the individual analyses. For example, the procedure 
depends on whether the likelihood is known for arbitrary top quark masses or only for a 
discrete set of values. Furthermore, some analyses require the simultaneous measurement of 
the top quark mass and jet energy scale. 

To cover the techniques applied, the fitting procedures used in the CDF template measure- 
ment in the £+jets channel [381 E] an d in the DO Matrix Element measurement in the £+jets 
channel [39] are described as examples in Sections 110. 1.11 and I10.1.2[ respectively. The fitting 
procedure does not depend a priori on the tt event topology. 

10.1.1 Fitting Procedure in the CDF Lepton+Jets Template Analysis 

This section describes the fit used in the CDF template analysis in the £+jets channel to 
determine the top quark mass mt and the jet energy scale JES (as well as the signal fraction). 

The event selection, estimators, and template parametrizations are described in Section ITTTl 
and are briefly recapitulated here: 

• The selected events are grouped into four categories with different expected signal to 
background ratio depending on the number and transverse energies of 6-tagged jets in 
the event. 

• The top quark mass m\ eco obtained in a kinematic fit of each measured event to the 
ti hypothesis is used as estimator for the top quark mass; the dijet mass m,-j is taken 
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as estimator for the jet energy scale whose deviation Ajes from the standard scale is 
measured in units of its uncertainty a c . 

• For signal tt events, the m r t eco templates are parametrized as functions of rrf t eco as well 
as of the true top quark mass m t ; similarly the nx^- templates are parametrized as 
functions of mjj and the parameter Ajes to be measured. The background templates 
are parametrized as functions of m\ eco and va.jj, too. 

The m r t eco and rrijj values in the data events are compared to the signal and background 
templates in an unbinned likelihood fit, which determines the top quark mass, jet energy scale, 
and the number of signal and background events in each of the four event categories. The 
likelihood for one event category is computed as the product of likelihoods for each data event, 
which in turn contain four terms each: 

• the likelihood to measure the reconstructed value of m r t eco , obtained from the linear 
combination of signal and background m T t eco templates for given m t and Ajes hypotheses, 
with relative contributions of signal and background also determined in the fit (mainly 
sensitive to m t ); 

• a similar term based on rrijj (mainly sensitive to Ajes); 

• a term describing the probability of having certain numbers of signal and background 
events in the data, given the total number of selected events; and 

• a constraint on the expected number of background events (not for the 0-tag event 
category) . 

The likelihoods for all four event categories are then multiplied, and a constraint to the a 
priori knowledge of the jet energy scale is included as another overall factor. Since the m t and 
Ajes parameters are the same in all event categories, a total of ten parameters are determined 
in the fit. These parameters are determined simultaneously using minuit [81]. 

10.1.2 Fitting Procedure in the DO Lepton+Jets Matrix Element Analysis 

In an analysis with parametrized templates, it is possible to let the minimization program 
(e.g. minuit) decide for which assumed parameter values to evaluate the overall likelihood. 
This is impractical for the Matrix Element and Ideogram methods, where the calculation of 
the overall likelihood for one hypothesis is a time-consuming process. In these analyses, a 
different approach is therefore followed: 

• In a first step, the overall likelihood is calculated for each hypothesis in a grid of assumed 
parameter values. 

• Second, the dependency of the likelihood on the parameters that are to be measured is 
fitted with a function. 

• The minimum of this function yields the central measurement value, and the statistical 
uncertainty is given by the 68% confidence region around this central value. 

As an example, the fitting procedure used in the DO Matrix Element measurement in the 
£+jets channel is described here. 

Also in this analysis a simultaneous measurement of the top quark mass mt, jet energy 
scale JES, and signal fraction f t l is performed. For each selected event, the signal likelihood 
is evaluated for a grid of assumed m t and JES values in steps of 2.5 GeV and 0.01. The 
background likelihood is calculated for JES = 1 only and is assumed not to depend on the 
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JES parameter value. 

For any given (m t , JES) assumption, the likelihood as a function of fti can then be calcu- 
lated easily as the linear combination given in Equation ( TT8|) . The signal fraction f^ est that 
maximizes the overall likelihood is calculated for each (m t , JES) parameter pair, and the like- 
lihood value corresponding to this value is used in further computations. The overall result 
quoted for the fitted signal fraction fa is derived from the value obtained at the (m t ,JES) 
point in the grid with the maximum likelihood value for the event sample. The uncertainty on 
fti is computed by varying f t i at fixed m t and JES until A(— InL) = +\. This uncertainty 
does not account for correlations between m t , and JES. 

The result for the top quark mass is obtained from a projection of the two-dimensional grid 
of likelihood values onto the m t axis. In this projection, correlations are taken into account. 
The likelihood for a given m t hypothesis is obtained as the integral over the likelihood as a 
function of JES, using linear interpolation between the grid points and Gaussian extrapolation 
to account for the tails for JES values outside the range considered in the grid. 

The likelihoods as a function of assumed top quark mass are converted to — In L values. 
These — In L points are then fitted with a fourth order polynomial in the region defined by 
the condition AlnL < 3 around the best value. The m t value that maximizes the fitted 
likelihood is taken to be the measured value of the top quark mass. The lower and upper 
uncertainties on the top quark mass are defined such that 68% of the total likelihood integral 
is enclosed by the corresponding top quark mass values, with equal likelihood values at both 
limits of the 68% confidence level region. The same projection and fitting procedure is applied 
to determine the value of the JES parameter. 

The inclusion of 6-tagging information introduces two significant improvements to the 
analysis: Both the separation between signal and background and the identification of the 
correct jet-parton assignment (under the signal hypothesis) are improved. Since the signal and 
background likelihoods are evaluated on an event-by-event basis and the 6-tagging information 
is encoded in the transfer function (cf. Section 18"^]) . both aspects are in principle addressed, 
and it should not be necessary to divide the event sample into subsamples of different purity 
like in the template analysis described in Section 110.1 .11 Nevertheless, the DO experiment 
has taken a different approach. The transfer function W(x,y; JES) given in Equation f[28]) 
is modified to obtain 



This new transfer function is used in the computation of the signal likelihood so that the L t i 
values can be compared with those of the background likelihood, which in turn are computed 
without taking 6-tagging information into account at all, i.e. with the transfer function given 
in Equation (127|) . 

This method implies that only the identification of the correct jet-parton assignment in 
ti events is improved. To recover the enhanced separation of signal and background events, 
the event sample is subdivided into three categories based on the number of 6-tagged jets per 
event. Overall values of the top quark mass m t , jet energy scale JES, and signal fraction 
fti are determined for all three categories together by relating the sample composition in 



W(x,y; JES) 



W(x,y; JES) 



(54) 




n 



i=l j=l 
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each category to the overall signal fraction. It should be possible in future updates of the 
measurement to use the full transfer function for the background likelihood and thus avoid 
fitting different subsamples of events. 

10.2 Validation and Calibration of the Measurement 

If the model used to describe the data is correct, then the measurement method should 
yield unbiased results and the correct statistical uncertainty. To validate the measurement 
technique, this assumption can be verified with simulated pseudo-experiments using events 
that have been generated with this model. 

However, most analysis techniques involve some simplifications, for example via the tem- 
plate parametrization or the simplified treatment of detector resolution and physics processes 
in the Matrix Element and Ideogram methods. Given these simplifications, it cannot be as- 
sumed that every aspect of the data is accounted for. To calibrate the measurement, it is first 
essential that the agreement between data and the full simulation is verified. Monte Carlo 
events generated with the full simulation are then used to compose pseudo-experiments for 
the calibration. 

The following information is obtained from pseudo-experiments: 

• The relation between the expected (mean) raw measurement value (m r t aw ) and the true 
input value mt- Because the mass range of interest is limited a priori to a range around 
a value mf, it is usually parametrized as a linear function in mt as 



The symbols s and o stand for the slope of the calibration curve and for the offset at 



• The width w of the pull distribution. To test that the fitted uncertainties describe the 
actual measurement uncertainty, the deviation of the measurement value from the true 
value is divided by the fitted measurement uncertainty in each pseudo-experiment. The 
width of this distribution of deviations normalized by the measurement uncertainty is 
referred to as pull width. 

• The expected distribution of measurement uncertainties. 

This information can be determined accordingly for any other parameter that is measured 
(JES and f t t, if applicable). In the validation step, values of s = 1, o = 0, and w = 1 
are expected. Because of simplifications in the measurement technique, this is in general not 
true for the calibration based on the full simulation. The values of s and o obtained in the 
calibration are used to correct the raw measurement value, and the measurement uncertainty 
is adjusted according to the value of w. As an example, the results from the validation and 
calibration of the DO measurement with the Matrix Element method in the £+jets channel 
are described in the following paragraphs. 

Validation: To validate the Matrix Element method, the DO collaboration has generated 
events with leading-order event generators (madgraph [85J for ti events and alpgen for 
ly+jets events), i.e. not including initial- or final-state radiation, for various values of the top 
quark mass and jet energy scale. These events have been smeared according to the transfer 
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function described in Section [5^1 The events are required to pass a simplified kinematic selec- 
tion similar to the actual event selection, and the normalization of the likelihood is determined 
for this selection according to Equation (123]) . 

Pseudo-experiments are composed of these events with the number of signal and back- 
ground events as observed in the data, and the measurement values m t and JES are deter- 
mined for each pseudo-experiment. In a test where 6-tagging information is not used (jets 
are assumed not to be 6-tagged), the fitted top quark mass and jet energy scale are unbiased 
within statistical uncertainties of the test of 300 MeV and 0.003, respectively. Furthermore, 
the fitted m t value does not depend on the input JES value used in the generation of the 
pseudo-experiments, and similarly, the fitted JES value is independent of the true input top 
quark mass. The pull width is in agreement with 1.0. This validation study and its results 
are described in detail in [86] . 

Calibration: For the calibration, fully simulated tt and IV+jets events are used to compose 
pseudo-experiments with the same numbers of events as measured in the datfl As an example, 
the calibration curves for the top quark mass and jet energy scale in the DO Matrix Element 
measurement (including 6-tagging information) are shown in Figures [321 arid [331 The deviation 
between input and fitted jet energy scale arises from the simplified description of the detector 
response. Given this offset, the (anti-)correlation between the top quark mass and jet energy 
scale measurements explains the observed shift between true and fitted mt. The widths of the 
pull distributions are slightly larger than one. 

For each pseudo-experiment, the statistical uncertainty on the top quark mass is multi- 
plied by the pull width, and the resulting distribution of statistical uncertainties is shown in 
Figure [3H This allows a comparison with the statistical uncertainty obtained in the data, 
which is also shown in the figure. 

The interpretation of such a comparison is not as straightforward as it may seem: The 
pseudo-experiments have always been composed with the same expected numbers of signal 
and background events. These numbers have been obtained from the data using a topological 
likelihood fit independent of the Matrix Element method; it yields the tt fraction of the sample 
with an (absolute) error of 7% for 0.4 fb _1 of data (the f t i result from the Matrix Element 
method itself has a similar uncertainty). If this uncertainty is also taken into account, what 
at first glance appears to be a discrepancy between the fitted top quark mass and jet energy 
scale uncertainties in the data and the expectation becomes much more consistent. This is 
shown in Figure [351 where the expected uncertainties for a combined m t and JES fit using 
topological information only (i.e., no 6-tagging information) are shown for two types of pseudo- 
experiments: Experiments with a sample composition according to the central measured value; 
and experiments with f t i varied down by one standard deviation. Much better agreement 
between predicted and observed uncertainties is obtained with the latter class of pseudo- 
experiments. 

The fti calibration curve for the CDF Matrix Element analysis in the £+jets channel is 
shown in Figure [361 The raw value flf w is smaller than the true value. This is due to the fact 

8 In the pseudo-experiments for their Matrix Element analysis, the DO experiment chooses to describe the 
multijet background with additional W^+jets events because the kinematic characteristics are similar. This 
simplification is accounted for with a systematic uncertainty. 
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Figure 32: Calibration of the fitting procedure in the DO Matrix Element analysis in the £+jets 
channel [39]. The upper plots show the reconstructed top quark mass (a) and the measured 
jet energy scale (b) as a function of the input top quark mass. The two lower plots show the 
reconstructed top quark mass (c) and the measured jet energy scale (d) as a function of the 
input jet energy scale. The solid lines show the results of linear fits to the points, which are 
used to calibrate the measurement technique. The dashed lines would be obtained for equal 
fitted and true values of m t and JES. 
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Figure 33: Calibration of the fitting procedure in the DO Matrix Element analysis in the i+jets 
channel HHty . The upper plots show the widths of the pull distributions for the top quark mass 
(a) and jet energy scale (b) as a function of the input top quark mass. The two lower plots 
show the widths of the pull distributions for the top quark mass (c) and jet energy scale (d) as 
a function of the input jet energy scale. The solid lines show the mean pull width, while the 
dashed lines indicate a pull width of 1.0. 



84 



10 THE TOP QUARK MASS FIT AND ITS CALIBRATION 



-£ 150 

.Q 

E 

CD 
W 

c 

LU 



D0 Run II, 0.4 fb 

(a) 



100 



CD 

E 

3 



-1 




CO 

E 

a> 
in 
c 

LU 



a 
.a 

E 

3 



200 



150 



100 




0.02 0.04 



o(m t ) (GeV) 



0.06 0.08 
o(JES) 



Figure 34: Test of the uncertainties on (a) m t and (b) JES obtained in the DO Matrix Ele- 
ment analysis P3$j . The distributions of fitted uncertainties obtained from pseudo-experiments 
are shown by the histograms. The histograms show the combined distributions of upper and 
lower uncertainties as the individual distributions are very similar. The upper and lower 
uncertainties observed in the data are indicated by the solid and dashed arrows, respectively. 
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Figure 35: DO Matrix Element analysis in the i+jets channel: Effect of the tt fraction f t i on 
the expected fit uncertainties. The uncertainties on (a) m t and (b) JES obtained by DO in 
the topological Matrix Element analysis when a sample composition according to the central 
fti value is assumed is shown by the solid histogram F3l% . Pseudo-experiments with f t i varied 
down by one standard deviation yield the distributions of uncertainties shown by the dash- 
dotted histogram. The upper and lower uncertainties observed in the data are indicated by the 
solid and dashed arrows, respectively. 
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Figure 36: Calibration of the f t i determination in the CDF Matrix Element analysis in the 
£+jets channel IffhlJ . The points with error bars show the raw fitted f t i value for various true 
tt fractions in the pseudo-experiments. The linear parametrization of these points is shown, 
and the values of the slope and offset (at f t i = 0) are indicated in the inset. To guide the eye, 
the dashed line shows the line f^ aw = f t i- 



that a leading-order matrix element is used to describe the tt process, while higher-order effects 
are included in the full simulation: In the simulation, about 20 — 30 % of tt events have jets 
and partons that cannot be unambiguously matched, i.e. at least one of the four reconstructed 
jets cannot be assigned to a parton from the tt decay. These events yield poor top quark mass 
information and degrade the uncertainty estimate of the likelihood fit. Figure 1371 shows a DO 
study which illustrates that jet-parton matched tt events tend to have a higher signal than 
background likelihood, which is how the mass fit identifies them as signal-like. There is no 
such separation for signal events in which one or more jets cannot be matched to a parton, so 
that these events contribute much less mass information to the final likelihood. 



10.3 Fit Results 



In this section, the fit results of the CDF template [SZ] and DO Matrix Element [325] analyses 



in the £+jets channel are described. These measurements have been chosen in order to give 
one example for each of the two fitting techniques. 

The reconstructed m r t eco and rrijj estimator distributions in data are shown in Figures I3"51 
and [33 respectively, for the CDF template measurement. The parametrized template distri- 
butions corresponding to the fitted parameters are overlaid. 

In the template measurement, the combined fit to the estimator distributions yields the 
likelihood as a function of assumed m t and JES values. In the Matrix Element technique, 
this information is determined from the individual event likelihoods. These results, including 
the statistical uncertainties, are visualized in Figure HO] for the two measurements. Contours 
are shown corresponding to AlnL = 0.5, 2.0, 4.5, and 8.0 relative to the minimum — InL 
value, where L denotes the likelihood for the event sample. The calibrations for mt and JES 
derived as discussed in the previous section are taken into account. 

The results quoted by the DO collaboration are obtained from the projection of the likeli- 
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Figure 37: DO Matrix Element analysis in the £+jets channel J33/: Distributions of 
log 10 (L t f/Lbk g ) for tt events with m t = 175 GeV (red and orange areas) and W+jets events 
(dark blue lines) for (a) e+jets events and (b) fi+jets events. The L t i values are calculated 
for the assumption m t = 175 GeV. The distributions for signal and background events are 
normalized individually. Those tt events where all jets can be matched to partons are shown 
in red, while tt events that fail this requirement give rise to the orange distributions. 
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Figure 38: CDF lepton+jets template measurement [67^ : Data m\ eco distributions in the (a) 2- 
tag, (b) l-tag(T), (c) l-tag(L), and (d) 0-tag event categories, together with the parametrized 
template distributions for signal+background and background only that correspond to the fitted 
parameters. 
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Figure 39: CDF lepton+jets template measurement [67\l : Data rrijj distributions in the (a) 2- 
tag, (b) l-tag(T), (c) l-tag(L), and (d) 0-tag event categories, together with the parametrized 
template distributions for signal+background and background only that correspond to the fitted 
parameters. 
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Figure 40: Results of the fits to determine the top quark mass and jet energy scale, (a) CDF 
template measurement in the £+jets channel [67\/ . (b) DO Matrix Element measurement in the 
£+jets channel JMy . In both cases, the contours corresponding to Ala L = 0.5, 2.0, 4.5, and 
8.0 relative to the minimum are shown. Note that in (a) the jet energy scale (vertical axis) 
is measured in units of the uncertainty a c of the external calibration, while in (b ) (horizontal 
axis) the multiplicative scale factor for jet energies is given. 
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hood onto the m t and JES axes as described in Section 110.1.21 These projections are shown 
in Figure HU together with the fitted curves. The central values and 68% confidence level 
intervals are also indicated. 




Figure 41: One- dimensional projections of the likelihood obtained in the DO Matrix Element 
measurement in the £+jets channel shown in Figure \4ty(b ) 13$/ . Plot (a) shows the likelihood 
as a function of assumed top quark mass. The correlation with the jet energy scale is taken 
into account. The fitted curve is shown, as well as the most likely value and the 68% confidence 
level region. The corresponding plot for the JES parameter is shown in (b). 



The comparison of the fitted uncertainties with the expectation from pseudo-experiments 
is discussed in the previous section. The statistical uncertainty includes the uncertainty from 
the absolute jet energy scale. The contribution of the absolute jet energy scale to the total 
statistical uncertainty can be estimated by repeating the fit with the JES parameter fixed. It 
should however be noted that fitting for one overall factor does not cover the entire systematic 
uncertainty due to the jet energy scale, cf. Section [11.2.11 
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11 Systematic Uncertainties 

The previous section described how the central measurement value of the top quark 
mass and the associated statistical uncertainty are determined. To date, the world- 
average value for the top quark mass is already systematically limited. This sec- 
tion discusses the individual sources of systematic errors, describes the correlations 
among various measurements and how they are handled in the combination, and 
indicates where systematic uncertainties may be reduced in the future. 

With the increasing size of the datasets collected at Run II of the Tevatron, the precision 
of the world-average value of the top quark mass has already become limited by systematic 
uncertainties. This is in spite of the fact that the measurement techniques have been improved 
during the past years. In particular, the determination of the jet energy scale from the same 
data that is used to measure the top quark mass has reduced the systematic uncertainty due 
to the detector calibration. Thus, the initial expectations for Run II of the Tevatron have 
already been surpassed. At the LHC, systematic effects will become even more dominant. 

In this section, the different sources of systematic uncertainties are discussed together 
with the way they are typically evaluated. Systematic correlations between measurements or 
experiments, which will tend to reduce the beneficial effect of combining several measurements, 
are mentioned. Also indicated are ideas for future improvements, as well as limitations. 

Systematic uncertainties can be broadly classified into three categories: modeling of the 
physics processes for ti production and background, modeling of the detector performance, 
and uncertainties in the measurement methods. The following discussion is ordered along the 
lines of this classification. 

In Table S] an overview of systematic uncertainties is given, quoting both the uncertainties 
on the world-average top quark mass [9] (which is only available in broad categories) and on 
one individual measurement [39] . 

11.1 Physics Modeling 

Many different processes can lead to the ti event candidates selected for a top quark mass 
measurement, and not all of them can be taken into account in the simulation. In addition, 
the description of the processes that are accounted for may still be subject to uncertainties. 
This type of uncertainties is discussed in this section, while effects not arising from a single 
hard interaction (multiple interactions) are treated in Section 111.21 

The top quark decay properties are well-known in the Standard Model, including the sub- 
sequent decay of the W boson into partons, since these decays are governed by the weak 
interaction and the top quark does not hadronize. The top quark width as a function of its 
mass is known [3], and the branching fraction of the decay t — > Wb is 100% for practical pur- 
poses. Furthermore, the mass, width, and branching fractions of the W are known precisely [5] 
and the associated uncertainties can be neglected. 

In contrast, significant uncertainties do arise from the production of the ti pair (modeling 
of the parton distribution functions and of initial-state radiation) and the formation of final- 
state jets (final-state radiation, fragmentation, and hadronization modeling). Usually, the 
Monte Carlo simulation of ti events is based on the leading-order matrix element for the 
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Source of Uncertainty 



World DO, Lepton+Jets 
Average Channel 



Statistical uncertainty ±1.2 ±2.5 

Physics modeling: ±1.0 



PDF uncertainty 




±0.16 -0.39 


ISR/FSR modeling 




±0.46 


b fragmentation 




±0.56 


b/c semileptonic decays 




±0.05 


W±jets background modeling 




±0.40 


QCD contamination 




±0.29 


Detector modeling: 


±1.4 




Absolute jet energy scale 




±3.2 -3.7 


JES px dependence 




±0.19 


b response (h/e) 




±0.63 -1.43 


Trigger 




±0.08 -0.13 


b tagging 




±0.24 


Noise, multiple interactions 






Method: 


±0.3 




Signal fraction 




±0.15 


MC calibration 




±0.48 


Total uncertainty 


±2.1 


±4.3 -4.9 



Table 4: Summary of uncertainties on the top quark mass. All values are quoted in GeV. 
Uncertainties on the world- average value are quoted in the second column. Only values 
corresponding to a broad classification of error sources are available for the world average. 
Some values from JMj have been combined to reflect the categories used here. The detector 
modeling uncertainty is dominated by that on the absolute value JES of the jet energy scale. 
The right column shows uncertainties for the DO Matrix Element measurement in the £+jets 
channel [39]. For asymmetric uncertainties the upper and lower errors are quoted separately. 
The uncertainty from the absolute JES value has been listed together with the systematic 
uncertainties in the right column even though it is determined with in situ calibration and 
scales with statistics. 
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processes qq — > ti and gg — > ti. However, next-to-leading-order Monte Carlo simulation is 
already available [S7]. The PDF parametrization and the modeling of initial- and final-state 
radiation have to be matched with the description of the hard-scattering process accordingly. 
The general description below remains however valid in both cases. 

11.1.1 PDF Uncertainty 

Parton distribution functions (PDFs) parametrize the probability to find a parton of a given 
flavor and momentum fraction inside the proton or antiproton, and thus the kinematic dis- 
tributions of signal and background events depend on the PDFs. The Tevatron experiments 
have agreed on a common procedure to evaluate the top quark mass uncertainty related to 
PDF modeling, which is described for example in [3pJ. Typically, the simulated events used 
to calibrate the measurements (cf. Section 110. 2p are based on a leading-order PDF set like 
CTEQ5L [22]. Systematic variations are however only provided for the PDF set CTEQ6M \27\ . 
Therefore, the top quark mass is recomputed with a calibration based on the central CTEQ6M 
PDF set, and the differences between that value and the ones obtained with the systematic 
variations of the CTEQ6M PDF are added in quadrature and assigned as a systematic un- 
certainty. Note that the difference between top quark masses evaluated with the calibrations 
based on the CTEQ5L and central CTEQ6M PDF sets is not included in the uncertainty. 
The difference between the results obtained with the CTEQ5L and MRST leading-order PDF 
sets is taken as another uncertainty. Finally, the effect from using MRST PDF sets based 
on different assumed a s values is determined. These three individual systematic uncertain- 
ties are summed in quadrature, the variation of CTEQ6M parameters yielding the dominant 
contribution. 

For the determination of the world-average top quark mass, the resulting error is taken 
as 100% correlated between individual measurements. The size of the uncertainty is given in 
Table H] for the DO measurement in the lepton+jets channel. The individual contributions are 



Since this systematic error is correlated between all measurements, a common procedure 
for its evaluation like the one described above is important. Improvements of the above scheme 
are however still desirable and possible: 

• The use of a leading-order matrix element together with a PDF set intended for processes 
in next-to- leading order is not consistent. The calibration of future measurements of the 
top quark mass should be based on next-to-leading-order Monte Carlo simulation using 
CTEQ6M (or updated PDF sets for which systematic variations are available), which 
would naturally resolve this inconsistency. 

• Different top quark mass measurements may be more or less sensitive to variations of 
individual parameters describing the PDF set. For example, depending on kinematic 
event selection cuts, the relative importance of the gluon PDF may vary even considering 
only Tevatron analyses; this will become a more important issue when measurements 
at the LHC are included as well, where the gg — > ti process dominates. Consequently, 
the quadratic sum resulting from the variations of all PDF parameters should not be 



CTEQ6M variations: 
difference MRST-CTEQ5L: 
variation of a s : 



+0.12 -0.38 GeV, 

±0.09 GeV, 
+0.06 -0.03 GeV. 
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taken as 100% correlated between measurements, but top quark mass shifts should be 
quoted for each individual PDF parameter variation. This will then allow for a more 
refined computation of the uncertainty on the world average, potentially slightly reducing 
the overall systematic error due to PDF uncertainties. (Note that no extra systematic 
uncertainties will have to be evaluated, only a more refined report of individual variations 
is needed.) 

• The comparison of top quark masses obtained with leading-order CTEQ and MRST 
PDF sets aims to quantify potential uncertainties arising from different PDF fitting 
procedures. However, these PDF sets are not based on exactly the same inputs, lead- 
ing to additional differences that should already be covered by the variation of CTEQ 
parameters. Since the systematic error arising from the CTEQ/MRST comparison is 
small, this is currently not an important issue. 
Currently, the systematic top quark mass error related to PDF uncertainties does not 
dominate the world average, cf. Table HI Because it is correlated between individual measure- 
ments, it may become important in the future, but only if no further improvements on PDF 
uncertainties are assumed. 

11.1.2 Initial- and Final- State Radiation 

Radiation off the incoming and outgoing partons may affect the top quark mass measurement. 
Such radiation changes the kinematics of the ti decay products in the final state; for example, 
the transverse momentum of the ti system is not zero when initial-state radiation (isr) takes 
place. Final state radiation (fsr) changes the momenta of the ti decay products and thus 
affects the shapes of templates or the signal probability assigned to an event. Also, isr or fsr 
may lead to jets which can be misidentified as ti decay products. 

Initial- and final-state radiation are governed by the same equations and are modeled in 
the shower evolution in the Monte Carlo simulation. (Interference between isr and fsr cannot 
be taken into account in this simulation, only when next-to- leading matrix elements are used.) 
The CDF experiment has shown how the details of the radiation process can be studied with 
Drell-Yan events [38| 157] . In Drell-Yan events only isr is present (photon radiation off charged 
leptons is assumed to be well-modeled, so only QCD radiation is considered here); it can 
lead to a non-zero transverse momentum px of the dilepton system. The mean dilepton px 
is shown to have a linear dependence on the logarithm of the dilepton invariant mass, which 
is reproduced by pythia simulation with standard parameter settings, cf. Figure H2J The 
study of Drell-Yan events also motivates two alternative pythia parameter sets leading to 
more or less isr and fsr activity, which are used to evaluate the systematic uncertainty on the 
top quark mass. The parameters changed are Aqcd and the scale factor k to the transverse 
momentum scale for isr showering; settings of Aqcd = 292 MeV and k = 0.5 are used for the 
sample with increased isr activity, while Aqcd = 73 MeV and k = 2.0 are taken for the sample 
with less isr. The resulting mean dilepton pt values are also indicated in Figure |42j 

The approach followed by DO in [39] is instead to vary directly the fraction of events with 
significant radiation. The calibration of the measurement is repeated based on events where a 
ti pair is produced together with an additional parton. Since the cross-section for ti production 
is 30% larger in next-to-leading order than in leading order, 30% of the observed difference 
between the top quark masses measured with the default and this alternative calibration is 
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Figure 42: JTie average pr of the dilepton system in Drell-Yan events, which is a measure of the 
level of isr activity, as a function of the dilepton invariant mass squared (note the logarithmic 
horizontal scale) l38\ [57| /. The points with error bars indicate CDF measurements, while the 
solid and dashed lines show predictions by the pythia generator (standard parameter settings 
and variations for systematic error evaluation, respectively). 
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assigned as systematic uncertainty. 

In the £+jets channel, the CDF experiment quotes a systematic uncertainty of 0.5 GeV 
for their template measurement [HZ], while the Matrix Element measurement is more sensitive 
to the modeling and finds a 1.0 GeV uncertainty [BB] (adding isr and fsr uncertainties in 
quadrature). For the DO Matrix Element measurement in the £+jets channel, a systematic 
error of 0.5 GeV has been evaluated [39] (see Table SJ) using a different technique as described 
above. Similar uncertainties have been obtained in the dilepton [?5 | l?6 ] 188] (0.4 GeV, 0.6 GeV, 
and 0.7 GeV, respectively) and all-jets channels [BH] (0.7 GeV), where the values quoted are 
from the measurements using 1 fb _1 of data. 

In the future, the uncertainty in isr and fsr modeling may become a dominant systematic 
error since it is correlated between all measurements. For consistency, it would therefore be 
highly desirable to arrive at an agreement between experiments on how to evaluate it, as is 
the case for the PDF error, see Section 111. 1.11 In addition, more precise studies of isr and fsr 
should be carried out. The analysis of Drell-Yan events that the CDF experiment has published 
in [38| 157] can be repeated with much more data and thus extended to larger invariant dilepton 
masses, so that the extrapolation to ti events becomes smaller. This may allow to decrease 
the width of the error band shown in Figure l4*2l In addition, an examination of LEP/SLC 
results on hadronic Z decays may yield independent experimental information on fsr. Finally, 
it may soon become worthwhile to optimize the measurement techniques not only in view of 
the statistical error, but to also keep systematic effects in mind. An idea developed for the 
LHC is to consider events in which the top and antitop quarks have large px, which means that 
their decay products are found in two separate event hemispheres; the mass of the top quark 
with the hadronic W decay could then be reconstructed from the individual hadrons, making 
jet reconstruction superfluous and rendering the measurement mostly insensitive to final-state 



94 



11 SYSTEMATIC UNCERTAINTIES 



radiation [8H]. But already the simultaneous JES fit in the £+jets Matrix Element analyses 
at the Tevatron has proven to reduce the sensitivity of the result to radiation modeling [9"U] . 
and an additional integration over the tt transverse momentum, as used in [73], may further 
reduce this uncertainty. 

11.1.3 Fragmentation 

Related to final-state radiation are the formation of jets in the final state and the spectra of 
hadrons within the jets. The fragmentation and hadronization of 6-quark jets is particularly 
important: in dilepton events, only &-quark jets are expected (except for jets from isr or 
fsr), and in £+jets and all-jets events, in situ calibration of the jet energy scale can largely 
absorb the dependence on the modeling of light (u, d, s, c) quark jets. Simulations based 
on different fragmentation and hadronization models may predict different average energy 
fractions contained within the reconstructed jet; this leads to an uncertainty on the relation 
between jet and parton energies and thus on the measured top quark mass. In addition, 
6-quark fragmentation also affects the efficiency of 6-jet identification: for a given 6-quark 
energy, an increase of the average energy fraction (x) carried by the bottom hadron will lead 
to a higher probability to detect a well-separated secondary decay vertex even for low-energy 
b quarks and will thus affect the kinematic distribution of the selected events. 

Data from LEP and SLC on Z — > bb decays constrain b fragmentation models and yield for 
example a precise determination of the mean energy fraction (xt,) of the weakly-decaying bot- 
tom hadron in Z decays [T]. To extrapolate to tt decays, different fragmentation models that 
are consistent with Z data are used to simulate tt events and the corresponding distribution 
in top quark decays (References [91] define (xb) as the bottom hadron energy divided by the 
maximum possible 6-quark energy). To evaluate the uncertainty on the top quark mass, the 
calibration of the measurement is determined using these different models, and the observed 
differences in the top quark mass are assigned as systematic error. 

Uncertainties in the decay of bottom (and charm) hadrons can also play a role. In partic- 
ular, jets containing a semileptonic decay of a heavy hadron will on average be reconstructed 
with a smaller energy due to the escaping neutrino. Thus the top quark mass depends on the 
rate and modeling of semileptonic heavy hadron decays. To assess uncertainties in the decay 
model, the semileptonic branching fractions of heavy hadrons in 6-quark jets are varied within 
the bounds from measurements in Z decays [T]. 

Like the other systematic errors related to physics modeling, the resulting uncertainties 
are correlated between measurements. The semileptonic branching fractions are known so 
precisely that the associated systematic error is negligible; however, the 6-quark fragmentation 
uncertainty may become a dominating uncertainty in the future, see Table HJ As a first step, 
a common scheme for evaluating this uncertainty should be agreed on. This could be the 
definition of a set of fragmentation models and parameters (like the ones studied in |39j ) 
on which the evaluation of uncertainties is based for each measurement. Such a common 
definition would not only lead to a consistent evaluation of uncertainties, but also allow for a 
correct determination of systematic correlations between individual measurements. As a next 
step, measurement techniques with reduced sensitivity to the details of 6-quark fragmentation 
could be developed. The technique based on high-p T top quarks mentioned in Section 111.1.21 
which does not rely on conventional jet finding, may serve as an example, but will need to be 
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refined to optimize the overall uncertainty. 

11.1.4 Top Quark Mass Definition 

A top quark mass measurement based solely on invariant mass reconstruction from the mo- 
menta of the decay products corresponds to a measurement of the pole mass. All results 
available today are based mainly on properties of the ti decay products which in turn depend 
on the top quark (pole) mass; the current measurements can therefore be regarded as pole 
mass measurements to a good approximation. However, calculations of effects involving the 
top quark mass like the ones described in Sections 112.2.11 and 112.2.21 are typically not per- 
formed using the pole mass. The transformation into the MS scheme is known to three loops 
and is e.g. given in [3]; such a transformation introduces an uncertainty when interpreting the 
top quark mass. 

Measurements in the dilepton channel necessarily include other information as well since 
the kinematics of the ti system is underconstrained; also measurements in the £+jets and all- 
jets channels make use of additional information from the ti production process to a varying 
degree in order to reduce the statistical measurement uncertainty. It still remains to be studied 
to what extent this fact leads to an uncertainty in the interpretation of the measurement 
results. 

In addition to the above, the pole mass itself is not defined to arbitrary accuracy for a 
colored particle like the top quark, and there is necessarily some additional color flow involved 
in the creation of the colorless final state measured in the detector. It has been shown in [18] 
that this introduces an intrinsic uncertainty of the order of Aqcd on the pole mass. The exact 
size of the uncertainty depends on the details of the measurement, and detailed studies of this 
effect are only starting. 

11.1.5 Color-Reconnection Effects 

Apart from the intrinsic uncertainty on the pole mass of a colored particle, color-reconnection 
effects between the final-state products may lead to additional effects. Corresponding studies 
for the measurement of the W boson mass at LEP2 are described in [2]. The effect in WW 
production at LEP2 is small compared to uncertainties on the top quark mass (a 35 MeV 
systematic error is quoted in the all-jets WW final state) in the present and near future. The 
all-jets WW final state may be considered similar to £+jets ti events; however, the kinematics 
are different; the colored beam remnants may well play an additional role in ti events at 
hadron colliders, and it is not clear how in situ calibration of the jet energy scale is affected. 
It is expected that the results of first studies of the size of color-reconnection effects will be 
published soon [92J. 

11.1.6 Bose-Einstein Correlations 

The LEP experiments have determined the effects from Bose-Einstein correlations between 
particles in WW events [2j. The resulting uncertainty on the top quark mass has not yet 
been studied, but it can be expected to be of the same order as that assigned to the W mass 
measured in the all-jets WW final state (7 MeV [2]). Such an uncertainty would be negligible 
for the top quark mass. 
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11.1.7 Underlying Event 

In principle, particles produced from the remnants of the colliding hadrons may contribute 
energy to the jets reconstructed in the detector. It is therefore necessary to measure the 
average contribution and subtract it from the jet energies. This is done as part of the jet 
energy calibration. The resulting uncertainty is small, as shown in Figure HH and included in 
the jet energy scale uncertainty (even though it is in principle correlated between experiments). 

11.1.8 Background Modeling 

All physics uncertainties (except for the top quark mass definition) discussed in the previous 
sections affect the modeling of both signal and background events. In this section, additional 
uncertainties that are specific to the background model are discussed, separately for £±jets, 
dilepton, and all-jets events. 

Lepton+Jets Channel: The two main backgrounds in the £+jets channel are leptonically 
decaying W bosons produced in association with jets (W+jets events) and multijet events 
containing a wrongly identified isolated lepton (QCD events). Both CDF (see e.g. [SSJ 155] ) 
and DO ([39]) find that the main uncertainty related to the modeling of W+jets background 
comes from a variation of the factorization scale \j? F used in the generation of these events. 
An additional contribution comes from the variation of the flavor composition of the jets in 
iy+jets events [381166]. 

Both CDF and DO base the estimate of QCD background on data. It is not straightforward 
to define a sample for this estimation that is kinematically unbiased and does not contain a 
sizeable ti component. Therefore, the QCD background estimate is replaced with iy+jets 
events, and the resulting difference is conservatively quoted as systematic error. 

The CDF values quoted for the uncertainty from modeling of iy+jets and QCD events 
are 0.2 GeV [55] and 0.5 GeV [38], but cannot be compared directly to the DO value of 
0.4 ©0.3 GeV = 0.5 GeV [39] (see Table HJ) since CDF and DO consider different factorization 
scales, and DO does not vary the heavy flavor content in W^+jets events. 

Dilepton Channel: The main backgrounds in the dilepton channel come from diboson 
(WW, WZ) or Drell-Yan production (Z/j* — > e + e~, t + t~) in association with jets, 

and from events with a mis- identified electron (e.g., W(—t \xv) + 3jets with a jet faking an 
electron). To estimate the systematic uncertainty, the number of expected events from each 
source is varied independently within its error, and the resulting top quark mass shifts are 
added in quadrature [721 ESI ESJ. The systematic uncertainties assigned in individual top 
quark mass measurements vary between ±0.3 GeV and ±1.0 GeV (and even ^"9 GeV). In 
addition, systematic variations of the background shapes yield another uncertainty of up to 
±1.0 GeV. Given the fact that the background contribution to the dilepton event samples is 
small while the statistical uncertainty is still large, it seems that some of these preliminary 
estimates are very conservative and that a much smaller uncertainty will be quoted in the 
future. 
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All- Jets Channel: In this channel, the dominant background is from QCD multijet produc- 
tion. In the CDF analysis [5H] the background is estimated from the data using a parametriza- 
tion of the 6-tagging efficiency. The overall normalization of this background estimate and 
the residual signal contribution are varied and each contribute a systematic error of 0.5 GeV 
on the top quark mass. The accuracy of the background estimator is checked with signal 
depleted event samples, and no additional shape uncertainty is assigned. 

The error on the world-average top quark mass that is due to background-specific un- 
certainties only amounts to 0.3 GeV [9]. It will be possible to select £+jets and dilepton tt 
samples for top quark mass measurements with much smaller backgrounds at the LHC [891 G3] 
because of the larger tt cross section and better detector resolution, so that it can be expected 
that the uncertainty from background modeling will further diminish in the future. 

11.2 Modeling of the Detector Response 

For most individual measurements of the top quark mass, the dominant error is due to un- 
certainties in the detector response, most notably the jet energy measurement (see Table H]). 
Even though these errors are only correlated between measurements of one experiment, and 
despite the possibility of in situ JES calibration, the absolute jet energy scale uncertainty 
still dominates the world average. 

Because simulated events are used to calibrate the mass measurements, it is not the un- 
certainty on the absolute detector response that matters, but the uncertainty on the relative 
difference between the data and the simulation. Contributions can in principle arise from 
any aspect of the data related with the event selection and/or top quark mass reconstruction, 
ranging from uncertainties in the modeling of an energy dependence of event quality cuts, re- 
construction or selection efficiencies, to the calibration of the reconstruction of the final-state 
leptons and jets. 

11.2.1 Jet and Charged Lepton Energy Scales 

In practice, by far the largest uncertainty arises from the uncertainty on the ratio of absolute jet 
energy scales in the data and simulation. Therefore, in situ calibration techniques are applied 
in the £+jets channel as described in Sections [HI and This means that the uncertainty 
on the absolute jet energy scales with the statistical error. The uncertainty obtained by DO 
with 0.4 fb" 1 is t 3 3 2 7 GeV [39]; the CDF experiment quotes 2.5 GeV using 0.68 fb" 1 [67]. 

Without this technique, external measurements of the jet energy scale as described in 
Section 15.21 have to be used, and the uncertainty on the ratio between data and simulation 
propagated to the final result. The resulting systematic error is currently between 3 and 
5 GeV in the ^+jets [57], dilepton [721 EH E3 EH [88], and all-jets channels [581 |69] and is 
correlated between all measurements at the same experiment. 

Even with in situ calibration, only one overall jet energy scale factor is determined. Any 
discrepancy between data and simulation other than such a global scale difference may lead 
to an additional uncertainty on the top quark mass, which is however much smaller than that 
arising from the overall absolute calibration. Uncertainties on residual \n\ and px dependencies 
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of the jet energy scale are taken from the external calibration and are typically estimated to 
be below 0.5 GeV, see for example References [39, 66J and Table HI 

The second-largest detector modeling uncertainty in the £+jets channel is the uncertainty 
on the double ratio between the jet energy scales for 6-quark and light jets in the data and 
simulation. This error is due to differences between the calorimeter response to electromag- 
netic and hadronic showers and the uncertainty on the electromagnetic/hadronic energy ratio 
in 6-quark jets. The CDF collaboration has evaluated it to be ±0.6 GeV [38, 66J, and the DO 
experiment has obtained GeV [35] . 

In comparison with the energy scale for jets, the absolute energies of charged leptons are 
calibrated precisely using leptonic Z decays. The uncertainty has been found to be negligible 
at DO [90]; the CDF experiment quotes an uncertainty of 0.1 GeV in the dilepton measure- 
ments [751 176] . 

In the future, information from the overall jet energy scale calibration described in Sec- 
tion 15. 2. 14 which is not used in measurements with in situ calibration, can be introduced as 
an additional constraint to improve the world average. The uncertainty related to the b- to 
light-quark jet energy scale ratio may become a limiting systematic error in the mid-term 
future. Even though it is related to detector response, it is correlated between all measure- 
ments. The measurement of Z — > bb events has proven very difficult at the Tevatron, and 
event samples with a b jet balanced by a photon or Z decay are limited in statistics. Ideas for 
the an in situ calibration of this energy scale ratio would therefore be very helpful; otherwise 
radical techniques like a top quark mass measurement based on secondary vertex decay length 
information [68J or leptonic J ftp decays in top quark events [94J can be employed using the 
large-statistics samples at the LHC. 

11.2.2 Event Selection 

Uncertainties in the event selection efficiency, notably energy- dependent effects, can lead to 
systematic effects on the top quark mass. For example, the trigger efficiency is measured in the 
data using reference triggers, and the uncertainty on the dependence on charged lepton and 
jet energies is propagated to the top quark mass result. Similarly, the 6-tagging efficiencies are 
determined from the data and varied within their uncertainties. Recent measurements in the 
£+jets channel quote systematic uncertainties of not more than a few hundred MeV [38| 139]. 
Since the event selection efficiencies are calibrated using the data, it can be expected that the 
associated uncertainty will further diminish in the future. 

11.2.3 Multiple Interactions 

Bunch crossings with more than one hard interaction may lead to events where the ti decay 
products cannot be easily identified, or with additional energy contributions to the jets from 
the ti final state. As long as such events are modeled accurately, these effects can be taken into 
account in the calibration. However, uncertainties on the instantaneous luminosity and the 
properties of the additional hard interaction lead to a systematic uncertainty on the top quark 
mass. Recent CDF measurements quote a 0.05 GeV [66] to 0.2 GeV [751 ESI EE] uncertainty. 

Overlay of calorimeter energy from subsequent bunch crossings was an issue at DO Run I 
but is no longer significant due to a change in readout electronics |39j . 
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11.3 Uncertainties Related to the Measurement Method 

Since the calibration of a measurement method is based on simulated events, limited Monte 
Carlo statistics gives rise to a systematic uncertainty on the top quark mass. There may be 
other systematic errors inherent to a specific method. An example is the DO Matrix Element 
measurement in the £+jets channel where the calibration depends slightly on the ti fraction in 
the selected event sample; the uncertainty on this fraction then leads to a systematic error on 
the top quark mass. Uncertainties of this type are normally uncorrelated between individual 
measurements, and are not dominant. 

11.4 Summary 



The precision of the world-average top quark mass is already limited by systematic errors [S] . 
Currently, the single largest uncertainty is due to the absolute jet energy scale. With in situ 
calibration using the hadronic W mass, this error will be reduced with larger data sets. Until 
the startup of the LHC, physics modeling uncertainties (which are correlated between all 
measurements) will become dominant. In particular, work on the consistent evaluation (and 
reduction) of the uncertainties due to isr/fsr modeling, 6-quark fragmentation, and the 6 /light 
jet energy scale ratio is very desirable in the near future. 
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12 Results, their Interpretation, and Future Prospects 

This section gives an overview of the most recent measurements of the top quark 
mass and how they contribute to the world average. The current knowledge of the top 
quark mass is then set into perspective by discussing its implications for the Standard 
Model of particle physics, notably for consistency tests and indirect constraints on 
the mass of the Higgs boson. Finally, the prospects for future improvements of top 
quark mass measurements are outlined. 



The Tevatron experiments have employed various methods to measure the top quark mass, 
as described in Sections ISTfTOl The most relevant individual measurements are combined 
by the Tevatron Electroweak Working Group, taking correlations into account as already 
outlined in Section [TTJ In Section 112.11 the individual measurement results are summarized, 
the combination procedure is described, and its current results are presented. Section 112.21 
gives an interpretation of these results in the framework of the Standard Model and also 
discusses implications for the Minimal Supersymmetric Standard Model (MSSM). Finally, an 
overview of improvements to be expected with the startup of the LHC and a future linear 
e + e~ collider (ILC) is given in Section [12. 31 



12.1 Measurement Results and Their Combination 

A large number of measurements of the top quark mass has been performed to date at the 
Tevatron, using data in the £+jets, dilepton, and all-jets decay channels and applying a wide 
variety of measurement techniques [8] . Table [5] summarizes the results. 

No significant deviations are apparent between the top quark masses measured in individ- 
ual decay channels, with different measurement techniques, by the two experiments, or at the 
two Tevatron center-of-mass energies of 1.8 TeV (Run I) or 1.96 TeV (Run II). However, many 
of the individual results are systematically and also statistically correlated. To quantify these 
statements, a consistent combination of results is performed by the Tevatron Electroweak 
Working Group [9] based on the best linear unbiased estimator (BLUE) [101], I102j . The 
procedure takes systematic correlations into account by treating individual systematic uncer- 
tainties as uncorrelated or 100% correlated between measurements as discussed in Section [TTJ 
More detailed studies are in general needed to evaluate the statistical correlation between 
measurements using the same dataset and decay channel. As an example, the statistical 
correlation between the top quark mass values determined at DO in the topological Matrix 
Element analysis and the Ideogram measurement (which uses b tagging) has been found to be 
only +40% [B21 ED] • Since Tevatron Run II analyses are still evolving, such a study is not yet 
available in many cases. Therefore, a combination of Run I values and only the most precise 
Run II measurements in each channel is performed. The correlations between these measure- 
ments are close to zero unless a correlation arises via common jet energy scale uncertainties; 
in that case correlation coefficients are typically of the order of 30%, the largest being 56% 
between the CDF Run I ^+jets and Run II all-jets measurements. 
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Table 5: Overview of top quark mass measurements. Analyses are grouped according to the tt 
decay channel listed in the leftmost column. The symbol "Ifr ± jets" denotes a selection based 
on IpT and jets only, yielding a sample enriched in events with a W — )■ tv decay. The list is 
further ordered according to the analysis technique (T: template based; ME: Matrix Element; 
DL: Dynamical Likelihood; ID: Ideogram), given in the second column together with the section 
describing it. All recent CDF and DO analyses of Run II data and those Run I measurements 
that are included in the world average J3/ are listed. The experiment and integrated luminosity 
are given, and the top quark mass results are quoted with their statistical and systematic 
uncertainties. For measurements using in situ calibration, the uncertainty from the overall jet 
energy scale is included in the first quoted error as it will scale with statistics in future updates. 
The rightmost column lists the weight given to measurements in the world average value. For 
the measurements marked with a T sign, an earlier result is used in the combination, while the 
most recent value is given in the table. 
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The combination yields average top quark masses in the individual channels of 

mt(M-jets) = 171.3 ± 2.2 GeV , 
m t (dilepton) = 167.0 ± 4.3 GeV , and (56) 
m t (all-jets) = 173.4 ± 4.3 GeV , 

where the uncertainties include both statistical and systematic errors. The correlations C and 
resulting \ 2 consistency values (for one degree of freedom) have been determined as 

C(£+jets, dilepton) = +37%, x 2 (^+jets, dilepton) = 1.2, 

C(£+jets, all-jets) = +29% , x 2 (^+jets, all-jets) = 0.24 , and (57) 
C(dilepton, all-jets) = +46% , x 2 (dilepton, all-jets) = 2.1 . 

Since the values for all three channels are consistent with each other, one overall combined 
top quark mass value is computed. It is found to be 

m t = 171.4 + 2.1 GeV. (58) 

The weights with which the individual top quark mass measurements contribute to this average 
are indicated in the last column of Table [5j Individual measurements may be assigned a 
negative weight in case of large correlations; as long as the weight is non-zero, the measurement 
still improves the average. This effect is explained very clearly and intuitively in jlOlj . The x 2 
for the average is 10.6 for 10 degrees of freedom, and the largest single pull of any measurement 
that enters the combination is 1.8, indicating good consistency of all 11 measurements. 

The £+jets, dilepton, and all-jets channels contribute with weights of 86.4%, 3.7%, and 
10.0% to the world average, respectively. These weights are indicative of the experimental 
situation at the Tevatron, with limited statistics in the dilepton and large backgrounds in the 
all-jets channel. With increasing data sets at Tevatron Run II, the relative importance of the 
dilepton channel may increase. Precise measurements of the top quark mass in the all-jets 
channel have only become possible after detailed studies of the background and its evaluation 
from the data. When in situ calibration techniques are applied in this channel, too, its weight 
may further increase. 

12.2 Interpretation of the Top Quark Mass Measurement 

As mentioned in Section 112.11 the top quark masses obtained in the £+jets, dilepton, and 
all-jets channels are consistent with each other. Moreover, the cross section for production of 
tt events at the Tevatron is consistent with the (Standard Model) expectation computed for 
the combined top quark mass value given in Equation (15B1 . No average value of all Tevatron 
Run II measurements of the tt cross section exists yet; however, the CDF experiment has 
performed a combination of CDF measurements [103J. The DO measurements can be found 
in [HJ 11061 E] • Figure H3] shows the combined CDF result and the recent DO measurement 
from Reference [H] together with the dependencies of the tt cross section measurements on 
the value of the top quark mass. Also shown are calculations of the tt cross section in next- 
to-leading order [13J as a function of the top quark mass. The measured cross sections agree 
well with the Standard Model prediction when assuming the world-average top quark mass 
value. 
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Figure 43: The average value of the ti production cross section in pp collisions at y/s = 
1.96 TeV as measured by the CDF experiment I103f is shown in (a). The vertical error bar 
indicates the ti cross section and its uncertainty evaluated at the CDF average value of the 
top quark mass. The dependence of the cross section measurement on the assumed top quark 
mass value is shown by the slope of the other error bar (its projection onto the horizontal 
axis corresponds to the uncertainty on the top quark mass using CDF measurements only). 
Also shown are NLO calculations of the Standard Model ti cross section, including threshold 
corrections from soft gluons \10J\ [105]. The uncertainty on these predictions is shown, too; 
it is dominated by the uncertainty on the gluon PDF. Consequently, no PDF uncertainty is 
included in the experimental result. Similarly, the result of a recent DO measurement fj^j 
is shown in (b) by the red lines, with the theoretical prediction from \10J$ overlaid. The 
combined information from the DO ti cross section measurement and the world-average value 
of the top quark mass from is indicated by the crossed error bars. Note the different scales on 
the horizontal axes of the two plots. 
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Also other top quark measurements like the relative cross sections for the various decay 
channels or differential cross sections agree well with Standard Model predictions Since 
there is no sign of effects beyond the Standard Model, it is appropriate to use the world-average 
top quark mass value in a consistency check of the Standard Model and, if consistency can be 
established, to extract information on Standard Model parameters. The interpretation within 
the Standard Model (SM) is discussed in Section 112.2.11 Analogously, the measurements can 
of course also be used to constrain the parameters of any other model that describes them. 
Particular attention has been devoted to supersymmetric models. The interpretation within 
the Minimal Supersymmetric Standard Model (MSSM) and the differences between MSSM 
and SM predictions are described in Section 112.2.21 While it is currently not yet possible to 
distinguish between the SM and MSSM based on indirect precision measurements, further 
improvements of these measurements may make this possible and thus provide information 
e.g. to help interpret potential future signals of new physics. 

12.2.1 Interpretation within the Standard Model 

An overall fit of Standard Model parameters is performed by the LEP Electroweak Working 
Group [2]. The general conclusion is that the Standard Model describes the measurements 
well and that there is no significant evidence for phenomena beyond the Standard Model. 

Using the Standard Model relations, it is possible to infer information even on those 
parameters that have not (yet) been directly measured. Of particular interest is the constraint 
on the mass of the Higgs boson. As outlined in Section [5J within the Standard Model the 
mass of the W boson depends quadratically on the top quark mass and logarithmically on the 
mass of the Higgs boson. This dependence is visualized in Figure HU Figure l4*4T a) shows the 
agreement between direct measurements of the W and top quark masses from LEP2 and the 
Tevatron, shown in blue, and indirect constraints that are valid within the Standard Model (red 
contour). Also shown is the Standard Model relation between mw and m t for various assumed 
values of the Higgs mass; the green band covers the range 114 GeV < m# < 1000 GeV. The 
lower value of ran = 114 GeV corresponds to the direct exclusion limit from LEP searches. 

Figure l4*4T b) shows how the top quark mass measurement contributes to the indirect con- 
straint on the Higgs mass. In the mu-mt plane, the blue contour depicts the information on 
these two parameters obtained from the Standard Model fit, where the direct m t measure- 
ment is not used as input. The projection of the blue contour onto the vertical axis thus 
corresponds to the indirect constraint on the top quark mass within the Standard Model of 
m t = 178 + ig GeV (note that a projection of the 68% C.L. contour from two dimensions to 
one does not correspond to one-dimensional 68% confidence limits). The green band cor- 
responds to the direct top quark mass measurement, which is in good agreement with the 
indirect prediction. The band visualizes how this information, given the W mass and other 
measurements, excludes large values of the Higgs mass within the Standard Model. Further 
improvements of the precision of the top quark mass measurement will improve the indirect 
constraint on the Higgs mass, but are unlikely to push this constraint into the region of mass 
values that has already been excluded at LEP, shown in yellow. 

Similarly, the information on the Higgs mass obtained from the W mass measurement is 
shown in Figure l4"4T c). The direct measurements of the W mass, shown as the green band, 
are in agreement with the blue contour showing the indirect constraints. The contour from all 
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Figure 44: A comparison of direct measurements and indirect constraints within the Standard 
Model on the top and W masses, as well as indirect constraints on the Standard Model Higgs 
mass JMj. (a) Direct measurements (blue dashed contour) of and indirect constraints (red solid 
contour) on m t and mw in the m t -mw plane, together with the Standard Model prediction of 
the relation between m t and mw for various assumed Higgs masses, (b) Direct measurement of 
m t (green band) and indirect constraints on m t andmu, excluding the direct m t measurement 
(blue contour), (c) Direct measurement of mw (green band) and indirect constraints on mw 
and mH, excluding the direct mw measurement (blue contour), (d) Indirect constraint on m^: 
Ax 2 with respect to the best fit as a function of assumed Standard Model Higgs boson mass. 
The light blue band indicates the uncertainty from higher-order corrections not included in the 
calculation. Also shown are fits including the NuTeV mw result (pink dotted curve) or based 
on a value of a(mz) obtained with additional theoretical input (red dashed curve). In (b), (c), 
and (d) the yellow area shows the region of Standard Model Higgs masses excluded by direct 
searches. 
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measurements but the W mass does not extend to high m# values in this plot since the top 
quark mass information is already included. With a significant improvement of the W mass 
uncertainty the region of Standard Model self-consistency might be significantly reduced even 
before direct Higgs searches become sensitive beyond the current limit. 

All indirect information on the Higgs boson mass is summarized in Figure BH^ d), where 
the black curve shows the A% 2 within the Standard Model as a function of assumed Higgs 
mass relative to the minimum value. The light blue band around it shows an estimate of the 
uncertainty from higher-order corrections that were not included in the calculation. Taking 
the information from this curve and including these theoretical uncertainties, the one-sided 
95% CL. upper limit on the Standard Model Higgs mass is 166 GeV. When the lower limit 
from direct searches is included, the upper limit shifts to 199 GeV. 

In summary, the Standard Model yields a good description of experimental data; for 
example the top quark mass measurement is in good agreement with indirect constraints valid 
within the Standard Model. The top quark mass measurement is an important ingredient 
to fits in which indirect information on the mass of the Standard Model Higgs boson can 
be obtained. With the precision of the top quark mass value achieved with the techniques 
described in this report it is possible to place stringent upper bounds on the mass of the Higgs 
boson within the Standard Model. 

12.2.2 Interpretation within the Minimal Supersymmetric Standard Model 

Even though there is no compelling experimental evidence of physics effects beyond the Stan- 
dard Model from collider experiments, it is instructive to interpret precision electroweak mea- 
surements also in extended models. As the top quark contributes via loop diagrams to the 
predictions for electroweak parameters, it is mandatory to know these contributions (and 
therefore the top quark mass) precisely to pin down any potential effects from additional, yet 
unknown, particles. In particular, a detailed study has been performed that compares the 
predictions of the Minimum Supersymmetric Standard Model (MSSM) [107] with those of the 
Standard Model (SM) in view of the precision measurements of the top quark and W boson 
masses [108]. This study is summarized here. 

After calculating contributions from loop diagrams involving supersymmetric particles, it 
is possible to compare the predictions of the SM and MSSM with each other and with the 
experimental data, as shown in Figure H5l The two model predictions lie within bands in the 
m t -mw plane, with only a narrow overlap region. Apart from the fact that the red and blue 
regions correspond to a variation of the Standard Model Higgs mass between 114 GeV and 
only 400 GeV, the information is equivalent to the predictions shown in Figure l44T a) where 
the upper value of the Higgs mass is set to 1000 GeV (and the axes are scaled differently). 
The green and blue areas indicate the allowed region for the MSSM (in the region above the 
green area, at least one of the mass ratios m^/m^ and m~ h Jm~ h ^ is larger than 2.5, where 
in both cases the lighter mass state is denoted by the index 1). The MSSM allowed region 
was obtained by varying supersymmetry parameters independently from each other. In the 
Standard Model, the blue area which is allowed in both models corresponds to the case of a 
light Higgs boson within the range allowed in the MSSM, while in the MSSM, it corresponds to 
the case where all superparticles are so heavy that the theory becomes effectively equivalent 
to the Standard Model. The current direct measurements of the top quark and W boson 
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Figure 45: Comparison of predictions of the SM and MSSM with current and potential future 
direct measurements of m t and mw ■ The figure is from 1108^ and has been prepared including 
calculations described in U09f . The area allowed in the SM corresponds to Higgs masses within 
the range 114 GeV < m# < 400 GeV, while the MSSM region has been obtained in a parameter 
scan. The blue ellipse shows the current direct measurements, while the sizes of the black and 
red contours indicate potential future improvements of the uncertainties with data from the 
LHC and a future linear e + e~ collider, respectively (the central values for these contours are 
arbitrary). 



masses are shown by the blue ellipse. The black and red contours indicate rough estimates 
of the precision that can be achieved at the LHC and a future linear e + e~ collider (ILC), 
respectively (for the LHC, uncertainties on m t and mw of 1 GeV and 15 MeV have been 
taken, respectively, while values of 0.1 GeV and 7 MeV have been assumed for the ILC). The 
current central measurement values have been used to place these contours. 

Even though the central measurement values of m t and mw are not within the SM allowed 
region, based on the current data it is not possible to distinguish between the SM and MSSM. 
Nevertheless, it is evident that with increasing precision on m t and mw, a comparison with 
model predictions may provide important constraints on the model parameters — or provide 
a cross-check of models to help decide which one is correct should physics effects beyond the 
Standard Model be discovered in the future. 

Similar to the consistency check performed within the SM described in Section 112.2. 1| it is 
possible to evaluate for models beyond the Standard Model which sets of parameter values are 
most likely. Such an analysis has for example been carried out in [HOj for various constrained 
versions of the MSSM. The results tend to favor a relatively low MSSM scale, which would 
make the discovery of light supersymmetric particles possible at the LHC or even the Tevatron; 
an actual determination of parameter values of the MSSM can however not be performed with 
the current data. 
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12.3 Potential for Improved Top Quark Mass Measurements 

Given the interpretation of the top quark mass measurement outlined in Section [12.21 above, it 
is clear that a further improvement of the experimental precision is desirable. The uncertainty 
on the current world average discussed in Section 112.11 is already dominated by systematic 
uncertainties. In the future, the focus will therefore have to shift from an optimization of 
the statistical uncertainty to a detailed study of the systematics listed in Section [TTJ In this 
quest, larger event samples will still help in two ways: First, some of the systematic errors are 
expected to improve with increasing sample sizes, and second, large event samples will allow 
to select small subsamples which are less prone to systematics than the rest. 

In the following, the evolving situation at the Tevatron experiments is discussed first. 
Second, the prospects for measuring the top quark mass at the LHC are described. Finally, 
an outline of the potential for top quark mass measurements at a future linear e + e~ collider 
(ILC) is given. 

12.3.1 Future Top Quark Mass Measurements at the Tevatron 

With the Run Ila dataset not even fully analyzed, the Tevatron experiments have already 
surpassed the expectation that a combined top quark mass uncertainty of 2-3 GeV would be 
possible with the full Run II dataset. This shows how important the newly developed tech- 
niques (Matrix Element method, in situ calibration) are, and how delicate it is to extrapolate 
from the current situation into the future. An extrapolation from the current combined result 
is particularly difficult as the world average combines different types of measurements based 
on data sets corresponding to different integrated luminosities. A general picture can how- 
ever still be obtained from an analysis of how various measurements contribute to the current 
world average and of how the individual uncertainties of the most sensitive measurements will 
evolve. 

From Table it is obvious that the Run II measurements in the £+jets channel carry by 
far the largest weight (the two measurements that exploit full event reconstruction have a 
combined weight of 80.6%). The statistical sensitivities of the CDF [66J and DO [39] Matrix 
Element measurements in the £+jets channel are quite similar; the difference in uncertainties 
comes from the difference in the size of the data sets analyzed so far and also from the fact that 
the observed DO error is slightly larger than expected from simulations. Also, the systematic 
uncertainties are similar. Figure H6] shows how the uncertainties of the DO measurement will 
evolve with statistics if the analysis is unchanged. All errors are assumed to remain constant 
except the statistical and JES uncertainties which will be reduced with larger statistics. At 
the end of Run II, signal modeling and the 6-jet energy scale will give rise to the limiting 
uncertainties. To estimate the reach of a combination of CDF and DO analyses it is a good 
approximation to read off the diagram at the sum of integrated luminosities analyzed: The 
signal modeling uncertainties are fully correlated, and even the error due to the 6-jet en- 
ergy scale will be partly correlated as it is a combination of calorimeter response (e/h) and 
hadronization uncertainties. 

While it is not justified to make more precise extrapolations into the future, the above 
indicates the areas where improvements are most needed. On the one hand, it will become 
important to develop measurement strategies that are less sensitive to the 6-jet energy scale. 
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Figure 46: The composition of the uncertainty in the DO Matrix Element measurement in the 
i+jets channel P3^ . which is based on an integrated luminosity of OA fb _1 , and the expected 
evolution of the uncertainty with integrated luminosity when the measurement technique is kept 
unchanged. An uncertainty of 0.5 GeV to cover color reconnection effects, which have been 
shown in pTBj to give rise to an effect of the order o/Aq CDj has conservatively been added (grey 
band), while the errors from semileptonic b- or c-hadron decays and from trigger efficiencies 
are negligible and have been omitted. The color code for the individual error contributions 
is explained in the figure. The widths of the colored bands indicate the individual squared 
uncertainties; the vertical axis is therefore non-linear and accounts for their quadratic addition. 
The two vertical dashed lines indicate the range of expectations for the integrated luminosity 
delivered to each Tevatron experiment by the end of Run II. The inset shows the expected 
uncertainties if the method is applied unchanged to an 8 fb -1 dataset. The figure is only 
intended to visualize the relative importance of various sources of uncertainty; it cannot provide 
accurate predictions for future measurements of the top quark mass. 
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An extreme example is the measurement based on the secondary vertex decay length [5*%] . 
One could also envisage for example a measurement based on the Matrix Element method 
which minimizes the combined statistical and systematic error by artificially worsening the 
6-jet energy resolution used in the probability computation, or which is extended to determine 
a 6-jet energy response factor. On the other hand, it will be possible to repeat the studies of 
the Z pt spectrum described in Section [11.1.21 with much larger samples, extend the invariant 
mass range, and obtain a more precise extrapolation to tt events. Dilepton measurements 
will be less affected by uncertainties on final-state radiation than those in the other channels. 
Currently, the dilepton channel contributes less than 5% to the world average, which is mainly 
because no in situ calibration of the jet energy scale is used here. It would be very worthwhile 
to check if this is possible, as it would yield information on the 6-jet energy scale. Even without 
in situ calibration, the CDF measurement in the all-jets channel [HI] contributes to the world 
average with a weight of 10%. While events in this channel are well constrained kinematically, 
it remains to be seen if uncertainties due to hadronization and color reconnection can be kept 
under control. 

In summary, it appears feasible that the total uncertainty on the top quark mass will be 
reduced from the current value of 2.1 GeV to about 1.5 GeV by the end of Tevatron Run II. 
The actual precision reachable will depend more on further innovative ideas on the treatment 
of systematics than on the exact integrated luminosity delivered. 

12.3.2 Future Top Quark Mass Measurements at the LHC 

The Tevatron measurements of the top quark mass have only been possible with a very good 
understanding of the detectors. After the startup of the LHC, it will still take some time until 
the LHC experiments will be able to improve the combined Tevatron result significantly. On 
the other hand, the physics of tt production will be well-understood from the Tevatron, and 
large samples will be selected, which can be used for the commissioning and calibration of the 
detectors - most notably, to determine the absolute jet energy scale and the 6-jet identification 
efficiency 

Consequently, studies for the LHC experiments focus on two aspects: Detector commis- 
sioning with tt events [1111 1112] and innovative measurements with reduced top quark mass 
systematics that are not feasible with Tevatron statistics [89, [931 E3] - Surely, top quark mass 
measurements will become a field of precision studies of systematic effects, but it is difficult 
to say today exactly what precision will finally be reached at the LHC for the top quark pole 
mass, as that depends on techniques that are only being developed now and will be developed 
further when the data is being taken. 

An interesting proposal has been made in [113] to identify double-diffractive tt events at the 
LHC. In these events, the tt center-of-mass energy could be measured from the reconstructed 
protons, and a measurement of the tt cross section as a function of this center-of-mass energy 
would lead to a determination of the top quark mass. This technique would be complementary 
to measurements of the top quark pole mass from the properties of the decay products, and 
would rather be similar to a tt threshold scan at an e + e~ collider, which is outlined in the 
following section. If the cross section and integrated luminosity are large enough and the 
experimental challenges are solved, this measurement technique might be a way to overcome 
the principal theoretical limitations of top quark pole mass measurements at a hadron collider. 
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12.3.3 Future Top Quark Mass Measurements at the ILC 

When a top-antitop cross section measurement is compared to predictions, this comparison 
yields a measurement of the parameter "top quark mass" that was used in the calculation of 
the prediction. Because of its large width, the top quark does not hadronize, but the e + e~ — > ti 
production cross section still rises steeply at the energy corresponding to a IS resonance, and 
the top quark mass can be determined from the energy where this rise is observed. (At larger 
energies, the cross section varies much less rapidly with center-of-mass energy, leading to a 
larger uncertainty when interpreted in terms of the top quark mass.) Suitable definitions of 
the top quark mass for such calculations (so-called "threshold mass" definitions) are discussed 
in |2T]. 

At an e + e~ collider, ti events will have a striking experimental signature and can be 
selected with very low backgrounds. This means that the event selection can be kept simple 
enough so that it does not (or only very marginally, thus not introducing large uncertainties) 
depend on the exact properties of the top quark decay products — which would otherwise 
result in a measurement of the top quark pole mass, as discussed before in Section 12.11 In 
addition to the clean signature by which ti events can be selected, another prerequisite for 
this type of measurement is that the initial state is well-known. This is the case for an e + e~ 
collider (where only initial-state photon radiation has to be taken into account), but not for 
a hadron collider where the partons that initiate the hard interaction are only a part of the 
colliding hadrons. 

References [1144 1115] quote experimental uncertainties of 20-30 MeV on the IS top quark 
mass. An uncertainty of Ace s (mz) = 0.001 corresponds to an uncertainty of 70 MeV in the 
conversion of the threshold mass to the MS scheme [116] . leading to an overall uncertainty on 
the MS top quark mass of less than 100 MeV. 

In addition to the threshold scan, measurements of the top quark pole mass will of course 
also be possible at the ILC above the ti threshold; while the statistical and experimental sys- 
tematic uncertainties may be small, the interpretation of such measurements will be limited 
by an additional uncertainty of order Aqcd as discussed before. Thus complex analysis tech- 
niques like the Matrix Element method will no longer be needed for the measurement of the 
top quark mass for which they were originally developed (but this does not invalidate them 
as a means of minimizing the statistical uncertainty in any other measurement based on few 
events whose kinematic properties are well-understood). 

The possibility to determine the top quark mass via a threshold scan at the ILC corre- 
sponds to an order of magnitude improvement of the current uncertainty that will to current 
knowledge not be possible via explicit mass reconstruction from the decay products. The re- 
sulting constraints on the Standard Model or models beyond it will be very precise, as shown 
in Figure H5J and since the parametric uncertainties in the model predictions resulting from 
the top quark mass will be much smaller than today, stringent consistency tests of the models 
will be possible, as outlined for example in [117] . 
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13 Summary and Conclusions 

A measurement of the top quark mass is interesting per se because the top quark is by far the 
heaviest known elementary fermion. It is interesting also because the top quark mass is needed 
as an input parameter to calculations of electroweak precision variables - measurements of 
which can then be used to perform consistency tests of models or to obtain indirect information 
on as yet unmeasured parameters like the Higgs boson mass. 

To date, top quarks can only be produced at the Fermilab Tevatron collider. The physics 
of top-antitop pair production and the resulting event topologies have been outlined. The 
reconstruction of the events has been described, it has been discussed how an accurate cal- 
ibration of the detectors is indispensable for the measurement of the top quark mass, and 
the calibration procedures applied at the Tevatron experiments have been introduced. The 
determination of the absolute calorimeter energy scale is particularly challenging but also of 
particular importance for the measurement of the top quark mass. 

Since the discovery of the top quark at Tevatron Run I, our understanding of ti production 
at hadron colliders has matured, much more integrated luminosity has been accumulated, and 
sophisticated techniques to measure the top quark mass have been developed that have led 
to an unanticipatedly large reduction of the uncertainty. These experimental techniques have 
been described in detail. On the one hand, this allows the reader to understand the details of 
the measurements of the top quark mass. On the other hand, this report is also intended as a 
reference for the methods, which can be used in the future for other measurements of similar 
experimental nature. 

The current world-average value of the top quark mass is already dominated by systematic 
uncertainties. Their various sources have been discussed to identify the current limitations 
and to point out possible future improvements. 

The interpretation of our current knowledge of the top quark mass within the Standard 
Model of particle physics has been presented. Current results of precision electroweak mea- 
surements are in striking agreement with Standard Model predictions, and thus the top quark 
mass can serve as an input to calculations with which constraints on the mass of the Standard 
Model Higgs boson can be placed. A similar interpretation can also be performed within 
extended models, and an analysis in the Minimal Supersymmetric Standard Model has been 
shown. Within the Standard Model, a light Higgs boson is clearly favored, and the data is 
also consistent with predictions within the Minimal Supersymmetric Standard Model. 

Finally, the prospects for future measurements at the LHC and a linear e + e~ collider have 
been outlined. Many studies have been performed of how to measure the top quark mass based 
on the reconstruction of the final state in ti events at the LHC. At the LHC much larger event 
samples will be available than at the Tevatron. The LHC experiments will thus be able to 
improve the Tevatron results: Given the large event samples, systematic uncertainties related 
to detector calibration can be reduced, and other uncertainties may be improved by a careful 
selection of special subsamples of ti events for the mass measurement. Measurements of the 
top quark pole mass will however always be limited by an intrinsic uncertainty of Aqcd- This 
uncertainty can be overcome in a threshold scan by comparing the measured ti production 
cross section as a function of the ti center-of-mass energy with predictions calculated as a 
function of the top quark mass (where calculations are not done in terms of the pole mass). 
This measurement technique, applied at a future e + e~ collider, will allow for an order of 



113 



magnitude improvement of the uncertainty on the top quark mass measurement. 

Measurements of the top quark mass have already become so precise that interesting con- 
straints can be placed on the Standard Model. It is foreseeable how the precision will further 
improve in the future. The measurement of the top quark mass will remain an important in- 
put for stringent consistency checks of the Standard Model or of models describing potential 
new discoveries. 
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